Home
Results for: Simpson's paradox
Statistics (1 of 5 sources) Open/Close data Source
Simpson's paradox

An intriguing paradox illustrating how one may be misled when a relevant variable is overlooked. The paradox is illustrated in the following example, which shows a cross-classification of three dichotomous variables, A, B, and C (where, for example, A1 and A2 are the two categories of A):

C1C2
B1B2TotalB1B2TotalB1B2Total
A1 95800 895+A1400 5 405=A1495 8051 300
A2  5100 105A2400195 595A2405 295 700
Total1009001 000Total8002001 000Total9001 1002 000

In the subpopulation corresponding to C1, there is a strong positive association between A and B. The same is true for the subpopulation corresponding to C2. However, when the information on these two very dissimilar subpopulations is pooled, the association for the entire population is strongly negative. See ecological fallacy for a diagram that shows how this type of result can occur with continuous data.





Philosophy Open/Close data Source
Wikipedia Open/Close data Source
Best of Web Open/Close data Source
Mentioned In Open/Close data Source