Design-based theory for causal inference

📅 2025-11-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Causal inference confronts practical challenges including high-dimensional covariates, individual noncompliance, and network interference; conventional model-based approaches often rely on strong parametric or functional-form assumptions. This paper systematically reviews design-driven advances in causal inference, focusing on covariate-balancing designs (e.g., stratified randomization, rerandomization), regression adjustment, and network-aware inference—grounded in the Fisher randomization test and Neyman asymptotic frameworks. Its key contribution lies in developing a robust, model-agnostic framework for randomization design and inference under weak assumptions, substantially enhancing identification power and statistical efficiency in high-dimensional, noncompliant, and network-interfered settings. The proposed methodology provides theoretically rigorous yet practically implementable tools for causal analysis in medicine, social sciences, and related domains. It further identifies promising future directions, including multi-scenario integration and co-design of algorithms and experimental protocols.

Technology Category

Application Category

📝 Abstract
Causal inference, as a major research area in statistics and data science, plays a central role across diverse fields such as medicine, economics, education, and the social sciences. Design-based causal inference begins with randomized experiments and emphasizes conducting statistical inference by leveraging the known randomization mechanism, thereby enabling identification and estimation of causal effects under weak model dependence. Grounded in the seminal works of Fisher and Neyman, this paradigm has evolved to include various design strategies, such as stratified randomization and rerandomization, and analytical methods including Fisher randomization tests, Neyman-style asymptotic inference, and regression adjustment. In recent years, with the emergence of complex settings involving high-dimensional data, individual noncompliance, and network interference, design-based causal inference has witnessed remarkable theoretical and methodological advances. This paper provides a systematic review of recent progress in this field, focusing on covariate-balanced randomization designs, design-based statistical inference methods, and their extensions to high-dimensional, noncompliance, and network interference scenarios. It concludes with a comprehensive perspective on future directions for the theoretical development and practical applications of causal inference.
Problem

Research questions and friction points this paper is trying to address.

Reviews recent advances in design-based causal inference methods
Focuses on covariate-balanced randomization and statistical inference techniques
Extends methods to high-dimensional, noncompliance, and network interference scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

Design-based inference using known randomization mechanisms
Covariate-balanced randomization designs for complex settings
Extensions to high-dimensional, noncompliance, and network scenarios
🔎 Similar Papers
No similar papers found.
X
Xin Lu
清华大学统计与数据科学系,北京, 100084
W
Wanjia Fu
清华大学统计与数据科学系,北京, 100084
H
Hongzi Li
清华大学统计与数据科学系,北京, 100084
H
Haoyang Yu
清华大学统计与数据科学系,北京, 100084
H
Honghao Zhang
清华大学统计与数据科学系,北京, 100084
K
Ke Zhu
北卡罗来纳州立大学统计系,罗利, NC 27695,美国;杜克大学生物统计与生物信息学系,达勒姆, NC 27710,美国
Hanzhong Liu
Hanzhong Liu
Tsinghua University
high dimensional statisticscausal inference