Estimating Continuous Treatment Effects with Two-Stage Kernel Ridge Regression

📅 2026-04-14
📈 Citations: 0
Influential: 0
📄 PDF

career value

184K/year
🤖 AI Summary
This study addresses selection bias induced by covariates in the estimation of continuous treatment effects by proposing a two-stage kernel ridge regression approach. In the first stage, the method models the joint dependence of the outcome on both the treatment variable and covariates; in the second stage, it constructs pseudo-outcomes that correct for distributional shifts to estimate the average treatment effect. The core innovation lies in an estimator that adaptively exploits potential structural simplifications in the treatment effect function, coupled with a fully data-driven model selection procedure that requires no prior knowledge. This framework simultaneously adapts to unknown degrees of overlap and kernel eigenvalue decay rates, achieving theoretical guarantees for adaptivity to both overlap conditions and function smoothness, thereby yielding more accurate estimates of continuous treatment effects.

Technology Category

Application Category

📝 Abstract
We study the problem of estimating the effect function for a continuous treatment, which maps each treatment value to a population-averaged outcome. A central challenge in this setting is confounding: treatment assignment often depends on covariates, creating selection bias that makes direct regression of the response on treatment unreliable. To address this issue, we propose a two-stage kernel ridge regression method. In the first stage, we learn a model for the response as a function of both treatment and covariates; in the second stage, we use this model to construct pseudo-outcomes that correct for distribution shift, and then fit a second model to estimate the treatment effect. Although the response varies with both treatment and covariates, the induced effect function obtained by averaging over covariates is typically much simpler, and our estimator adapts to this structure. Furthermore, we introduce a fully data-driven model selection procedure that achieves provable adaptivity to both the unknown degree of overlap and the regularity (eigenvalue decay) of the underlying kernel.
Problem

Research questions and friction points this paper is trying to address.

continuous treatment effects
confounding
causal inference
selection bias
effect function
Innovation

Methods, ideas, or system contributions that make the work stand out.

continuous treatment effect
two-stage kernel ridge regression
confounding adjustment
data-driven model selection
distribution shift correction
🔎 Similar Papers