Automatic debiasing of neural networks via moment-constrained learning

📅 2024-09-29

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

192K/year

🤖 AI Summary

To address bias arising from direct regression in causal and nonparametric estimation, this paper proposes Moment Constraint Learning (MCL), a method that directly optimizes the Riesz representer (RR) for automatic debiasing. Unlike existing automatic debiasing approaches relying on custom loss functions, MCL introduces explicit moment-matching constraints to mitigate issues associated with extreme inverse-probability weights and inaccurate conditional density estimation, while enhancing RR robustness to hyperparameter selection. Built upon an end-to-end neural network framework, MCL integrates semi-supervised learning objectives with counterfactual modeling. On semi-synthetic benchmarks, MCL reduces average treatment effect estimation error by 23–37% compared to state-of-the-art methods—including Double Machine Learning and Targeted Learning—exhibiting improved convergence stability, stronger unbiasedness, and greater practical utility.

Technology Category

Application Category

📝 Abstract

Causal and nonparametric estimands in economics and biostatistics can often be viewed as the mean of a linear functional applied to an unknown outcome regression function. Naively learning the regression function and taking a sample mean of the target functional results in biased estimators, and a rich debiasing literature has developed where one additionally learns the so-called Riesz representer (RR) of the target estimand (targeted learning, double ML, automatic debiasing etc.). Learning the RR via its derived functional form can be challenging, e.g. due to extreme inverse probability weights or the need to learn conditional density functions. Such challenges have motivated recent advances in automatic debiasing (AD), where the RR is learned directly via minimization of a bespoke loss. We propose moment-constrained learning as a new RR learning approach that addresses some shortcomings in AD, constraining the predicted moments and improving the robustness of RR estimates to optimization hyperparamters. Though our approach is not tied to a particular class of learner, we illustrate it using neural networks, and evaluate on the problems of average treatment/derivative effect estimation using semi-synthetic data. Our numerical experiments show improved performance versus state of the art benchmarks.

Problem

Research questions and friction points this paper is trying to address.

Debiasing neural networks via moment-constrained learning

Improving robustness of Riesz representer estimates

Addressing biased estimators in causal and nonparametric estimands

Innovation

Methods, ideas, or system contributions that make the work stand out.

Moment-constrained learning for debiasing

Direct Riesz representer minimization via loss

Neural networks applied to treatment effect estimation

🔎 Similar Papers

From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings

2024-02-18arXiv.orgCitations: 0

Instacart

CA, NY, CT, NJ$240,000—$253,500 USDWA$230,000—$243,000 USDOR, DE, ME, MA, MD, NH, RI, VT, DC, PA, VA, CO, TX, IL, HI$221,000—$233,000 USDAll other states$201,000—$212,000 USD

remote

Research Engineer, Monetization AI