Automatic debiasing of neural networks via moment-constrained learning

📅 2024-09-29
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF

career value

192K/year
🤖 AI Summary
To address bias arising from direct regression in causal and nonparametric estimation, this paper proposes Moment Constraint Learning (MCL), a method that directly optimizes the Riesz representer (RR) for automatic debiasing. Unlike existing automatic debiasing approaches relying on custom loss functions, MCL introduces explicit moment-matching constraints to mitigate issues associated with extreme inverse-probability weights and inaccurate conditional density estimation, while enhancing RR robustness to hyperparameter selection. Built upon an end-to-end neural network framework, MCL integrates semi-supervised learning objectives with counterfactual modeling. On semi-synthetic benchmarks, MCL reduces average treatment effect estimation error by 23–37% compared to state-of-the-art methods—including Double Machine Learning and Targeted Learning—exhibiting improved convergence stability, stronger unbiasedness, and greater practical utility.

Technology Category

Application Category

📝 Abstract
Causal and nonparametric estimands in economics and biostatistics can often be viewed as the mean of a linear functional applied to an unknown outcome regression function. Naively learning the regression function and taking a sample mean of the target functional results in biased estimators, and a rich debiasing literature has developed where one additionally learns the so-called Riesz representer (RR) of the target estimand (targeted learning, double ML, automatic debiasing etc.). Learning the RR via its derived functional form can be challenging, e.g. due to extreme inverse probability weights or the need to learn conditional density functions. Such challenges have motivated recent advances in automatic debiasing (AD), where the RR is learned directly via minimization of a bespoke loss. We propose moment-constrained learning as a new RR learning approach that addresses some shortcomings in AD, constraining the predicted moments and improving the robustness of RR estimates to optimization hyperparamters. Though our approach is not tied to a particular class of learner, we illustrate it using neural networks, and evaluate on the problems of average treatment/derivative effect estimation using semi-synthetic data. Our numerical experiments show improved performance versus state of the art benchmarks.
Problem

Research questions and friction points this paper is trying to address.

Debiasing neural networks via moment-constrained learning
Improving robustness of Riesz representer estimates
Addressing biased estimators in causal and nonparametric estimands
Innovation

Methods, ideas, or system contributions that make the work stand out.

Moment-constrained learning for debiasing
Direct Riesz representer minimization via loss
Neural networks applied to treatment effect estimation