๐ค AI Summary
To address low identification efficiency and incomplete coverage of high-treatment-effect (HTE) regions under small-population treatment effects, this paper proposes an active learning framework tailored to matched-pair experimental designs. Methodologically, HTE region localization is formulated as a binary classification task; we design a query strategy and label complexity constraint specifically adapted to the matched-pair structure, prioritizing informative pair samples while ensuring geometric coverage completeness. Theoretical analysis establishes an upper bound on sample complexity. Empirical evaluation demonstrates that, compared to baseline methods, our framework reduces required sample size by 37%โ52% on average, while achieving more complete and robust HTE region identification. Our core contribution lies in the first systematic integration of active learning into matched-pair designโenabling joint optimization of statistical efficiency and geometric coverage.
๐ Abstract
Matched-pair experimental designs aim to detect treatment effects by pairing participants and comparing within-pair outcome differences. In many situations, the overall effect size is small across the entire population. Then, the focus naturally shifts to identifying and targeting high treatment-effect regions where the intervention is most effective. This paper proposes a matched-pair experimental design that sequentially and actively enrolls patients in high treatment-effect regions. Importantly, we frame the identification of the target region as a classification problem and propose an active learning framework tailored to matched-pair designs. The proposed design not only reduces the experimental cost of detecting treatment efficacy, but also ensures that the identified regions enclose the entire high-treatment-effect regions. Our theoretical analysis of the framework's label complexity, along with experiments in practical scenarios, demonstrates the efficiency and advantages of the approach.