Five disturbing impossibility theorems
[stat_ml
]
I remember reading, maybe in a decades-ago issue of Scientific American, that geometry and physics professors sometimes hear from cranks. These are arrogant people, working in isolation, who earnestly believe they have discovered something important. Typical examples might be trisecting an angle with a compass and straightedge in a finite number of steps or unifying physics. I have not understood the details of those examples personally, but there is a basic, recurring disconnect between the position of the crank – “The establishment has tried and failed; they do not understand” – and the position of the establishment – “We understand thoroughly; this task is proven to be impossible.”
In order to avoid wasting time on the wrong side of this dichotomy, the progressive scientist naturally wants to be acquainted with any impossibility theorems that affect their studies. In my field, statistics, this is hard. Except for certain classics like the cramer-rao lower bound, our impossibility theorems are not often taught, and some of them are genuinely surprising. In particular, one would hope that just about any statistical model could be learned correctly in principle. With no technological barriers, no ethical objections, and no limits on funding or resources, or simply with a computer simulation as the data generating mechanism, one would hope that cranking out more and more data would lead to inferences that are eventually correct, except in obvious cases like a regression with multiple exactly identical predictors.
One would be disappointed. Beyond that, one would be disturbed by the number of basic and fundamental goals that are now known to be out of reach. And not just out of reach given present techniques or datasets: provably, irretrievably out of reach given any technique and any experimental design. Some examples:
- You can’t achieve optimal inference and optimal prediction simultaneously.
- You can’t satisfy three simple desiderata for a clustering algorithm.
- You can’t estimate the mean.
- You can’t test for independence conditional on a quantitative random variable.
- You can’t construct confidence intervals near a region of non-identifiability.