🤖 AI Summary
This paper addresses causal inference when only a small number of units—or clusters—receive treatment, a setting where standard asymptotic inference fails: even with large overall sample sizes, confidence intervals exhibit undercoverage and hypothesis tests suffer from low power. To resolve this, we develop a unified finite-sample inference framework that integrates exact inference, randomization tests, doubly robust estimation, synthetic control, and panel fixed-effects methods—emphasizing non-asymptotic theoretical foundations and resampling techniques (e.g., permutation and residual bootstrap). Our key contributions are threefold: (i) providing rigorous finite-sample justification for widely used heuristic procedures; (ii) systematically clarifying the distinct inferential logic for cross-sectional versus panel data with few treated units; and (iii) proposing improved confidence intervals and testing procedures. Simulation and empirical results demonstrate substantial gains in coverage accuracy and statistical reliability over conventional asymptotic approaches.
📝 Abstract
In many causal inference applications, only one or a few units (or clusters of units) are treated. An important challenge in such settings is that standard inference methods that rely on asymptotic theory may be unreliable, even when the total number of units is large. This survey reviews and categorizes inference methods that are designed to accommodate few treated units, considering both cross-sectional and panel data methods. We discuss trade-offs and connections between different approaches. In doing so, we propose slight modifications to improve the finite-sample validity of some methods, and we also provide theoretical justifications for existing heuristic approaches that have been proposed in the literature.