🤖 AI Summary
Existing methods struggle to detect localized differences in treatment effects at the distributional level—such as in tails, dispersion, or rare events—and global tests often lack interpretability. This work proposes DR-ME, the first semi-parametrically efficient and finitely interpretable test for distributional treatment effects. By integrating orthogonal doubly robust kernel embeddings, covariance whitening, and sample splitting, DR-ME evaluates intervention kernel witness functions at learned outcome locations, achieving locally optimal signal-to-noise ratios while preserving post-selection validity. Empirical results demonstrate that DR-ME accurately controls Type I error rates and exhibits superior power compared to existing global kernel-based methods. In a medical imaging application, it successfully identifies clinically meaningful localized distributional effects of treatment.
📝 Abstract
Distributional treatment effects can be invisible to means: a treatment may preserve average outcomes while changing tails, modes, dispersion, or rare-event probabilities. Kernel tests can detect discrepancies between interventional outcome laws, but global tests do not reveal where the laws differ. We propose DR-ME, to our knowledge the first semiparametrically efficient finite-location test for interpretable distributional treatment effects. DR-ME evaluates an interventional kernel witness at learned outcome locations, returning causal-discrepancy coordinates rather than only a global rejection. From observational data, we derive orthogonal doubly robust kernel features whose centered oracle form is the canonical gradient of this finite witness. For fixed locations, we characterize the local testing limit: DR-ME is chi-square calibrated under the null, has noncentral chi-square local power, and uses the covariance whitening that optimizes local signal-to-noise for discrepancies visible through the selected coordinates. This efficient local-power geometry yields a principled location-learning criterion, with sample splitting preserving post-selection validity. Experiments show near-nominal type-I error, competitive power against global doubly robust kernel tests, and interpretable learned locations that localize distributional effects in a semi-synthetic medical-imaging study.