201 – Reasoning with probabilities
I’m reading a very interesting book called Reckoning With Risk: Learning to Live With Uncertainty by Gerd Gigerenzer (holiday reading). He is a psychologist who specialises in the way that risk is communicated and perceived, or often mis-perceived.
One of his interesting points is that, even when well educated people (like doctors, for example) have accurate knowledge of probabilities related to a risk, they are often incapable of manipulating those probabilities in fairly simple ways to make sound judgements about the risk.
Here is a striking example. He asked 24 doctors (with an average of 14 years experience) the following question.
The probability that a randomly chosen woman has breast cancer is 0.8 percent. If a woman who has breast cancer undergoes a mammogram, the probability is 90 percent that the mammogram result will be positive (i.e. indicating that she does have breast cancer). If a woman who does not have breast cancer undergoes a mammogram, the probability is 7 percent that she will have a positive mammogram. Imagine a woman who has a positive mammogram. What is the probability that she actually has breast cancer?
See if you can work out the answer, at least approximately. People who are used to working with statistics and probabilities – like, say, academic economists – should find it easy. (It’s an application of Bayes’ law (see appendix), but you don’t need to know that to solve it.)
Not surprisingly, most people find it very difficult to untangle information like this that includes conditional probabilities, even doctors whose job it is to advise people about these sorts of things.
Of the 24 doctors who responded to the question, only four got close. Most were wildly inaccurate, giving probabilities that were far too high. It seems like they had no idea how to tackle the question. The most common answer, but far, was 90 percent. But that obviously can’t be right because that would be the answer if all women had cancer, whereas 99.2 percent of them don’t!
This lack of probabilistic reasoning among doctors is a pretty concerning finding. You’d like to think that your doctors could provide sound advice about medical risks, if you wanted it, but in this case, at least, you’d be disappointed. It actually reinforces a feeling I’ve had a number of times that doctors who were advising me did not have a good understanding of probabilities.
Gigerenzer then did something quite clever. He reformulated the question in a way that made it much easier for people to think about, as follows.
Eight out of every 1000 women have breast cancer. Of these 8 women with breast cancer, 7 will have a positive mammogram. Of the remaining 992 women who don’t have breast cancer, 70 will still have a positive mammogram. Imagine a randomly selected sample of women who have positive mammograms. How many of these women actually have breast cancer? ____ out of ____.
This is the same information as in the earlier question (with some rounding). From this description, it is much easier to see that, of the 1000 women we are considering, 77 will have positive mammograms, and of those 77, only 7 will actually have cancer. Thus the answer is that 7/77 = 1/11 = 9% of women with a positive mammogram would actually have cancer.
Gigerenzer argues that this approach of reformulating probabilistic information into absolute numbers (i.e. it talks about 8 women with breast cancer, rather than 0.8% of women) will generally improve people’s understanding of risk. It certainly worked pretty well with another sample of 24 doctors. Almost half got the second version of the question exactly right, and most got it approximately right. Only 5 of the 24 were highly inaccurate.
That’s a big improvement, although I’m still worried that 20% of doctors could still get the answer completely wrong when the question is posed in such a transparent and intuitively obvious way. I hope they aren’t my doctors!
In follow up research, Gigerenzer investigated what the doctors actually did when they were trying to answer these questions – which numbers from the problem description did they use and how did they combine them. For the first more difficult description based on conditional probabilities, it became even clearer that the doctors didn’t have a clue what to do. Only 20% of them used the same reasoning strategy again when the question was repeated for a different medical test. Clearly, they were just guessing, and overwhelmingly guessing wrong.
Given the intuitive problem description, more doctors used the right reasoning strategy and applied it consistently, but there was still a disturbing number who didn’t.
It certainly will make me want to explore the probabilities more deeply with my doctor when I next have a medical diagnosis.
Appendix: Using Bayes Theorem to solve the problem
Bayes Theorem: P(a|b) = P(b|a)*P(a)/P(b)
where
a = has breast cancer
b = has positive mammogram
From the problem description:
P(a) = 0.008
P(b|a) = 0.9
P(b) = 0.008*0.9 + 0.992*0.07 = 0.077
So
P(a|b) = 0.9*0.008/0.077 = .094
That is, if a woman at random has a positive mammogram, the probability that she has breast cancer is 9%. The intuitive approach, based on absolute frequencies, is clearly much easier to think about than this, although it amounts to the same thing in the end.
Further reading
Gigerenzer, G. (2002). Reckoning With Risk: Learning to Live With Uncertainty, Penguin, London.
Hi David
That is a very nice demonstration.
Hopefully there are further tests available or are 9 out of 10 being treated for something they have not got.
Happy Christmas
Nigel Hall
The book describes some tragic cases for other diseases where statistical ignorance led to medical experts under-rating (or in fact completely discounting) the likelihood of a false postive test result. For example, a woman with a false positive HIV test result was advised that there was no possibility that the result was wrong. She went for 18 months believing she would develop AIDS, with big consequences for her mental and physical well-being. For example, thinking that there was nothing to lose, she had unprotected sex with an HIV-positive partner. In the case of breast cancer, however, my understanding is that follow up tests are routinely conducted, so treating healthy people is unlikely.
Our probabilistic incompetence would be distressing enough if it were just confined to something as frivolous as medical diagnosis. But the exact same problem is commonly encountered in environmental management whenever we have imperfect detection: When can we declare successful eradication of a weed? What’s the chance a contagious disease is still present in an animal population, despite negative tests on a subset of individuals? If we allow development of a site, what’s the chance an endangered species is present even though biological survey records absence? In acquiring land for conservation reserves, how confident are we that species of interest are actually captured by the reserve network? The first three problems allude to the costs of false negatives. The last is more about the cost of false positives. In all cases, the same thinking encouraged by Gigerenzer (or Bayes theorem) can be used, so long as we have pre-diagnosis estimates of prevalence (from a model, or sparse data, or expert opinion) and we’re able to hazard a guess at the reliability of our diagnostic ‘test’ (mammogram, blood test, biological survey).
It’s a curious thing that epidemiologists, who have known how to approach and solve these problems for decades, have had limited success in getting medical practitioners and vets to think straight. It’d be nice if we could use an understanding of cognitive psychology to avoid similar problems in environmental management.