Transforming Conditional Density Estimation Into a Single Nonparametric Regression Task

📅 2025-11-23
📈 Citations: 0
✹ Influential: 0
📄 PDF
đŸ€– AI Summary
This paper addresses the challenge of applying powerful regression models directly to high-dimensional conditional density estimation (CDE). We propose a novel framework that reformulates CDE as a single nonparametric regression task by constructing labeled auxiliary samples, thereby mapping density estimation into a standard regression problem. Our approach imposes no parametric assumptions on the conditional density and enables plug-and-play integration of arbitrary state-of-the-art regressors—including deep neural networks and gradient-boosted trees. We establish theoretical consistency: the proposed estimator converges almost surely to the true conditional density as the sample size tends to infinity. Extensive experiments on synthetic data, U.S. Census records, and satellite imagery demonstrate that our method significantly outperforms existing CDE benchmarks across diverse real-world scenarios. Moreover, the results align with domain-specific prior knowledge, underscoring the method’s balance of theoretical rigor and practical efficacy.

Technology Category

Application Category

📝 Abstract
We propose a way of transforming the problem of conditional density estimation into a single nonparametric regression task via the introduction of auxiliary samples. This allows leveraging regression methods that work well in high dimensions, such as neural networks and decision trees. Our main theoretical result characterizes and establishes the convergence of our estimator to the true conditional density in the data limit. We develop condensité, a method that implements this approach. We demonstrate the benefit of the auxiliary samples on synthetic data and showcase that condensité can achieve good out-of-the-box results. We evaluate our method on a large population survey dataset and on a satellite imaging dataset. In both cases, we find that condensité matches or outperforms the state of the art and yields conditional densities in line with established findings in the literature on each dataset. Our contribution opens up new possibilities for regression-based conditional density estimation and the empirical results indicate strong promise for applied research.
Problem

Research questions and friction points this paper is trying to address.

Transforms conditional density estimation into nonparametric regression
Leverages neural networks for high-dimensional density estimation
Achieves state-of-the-art performance on survey and satellite datasets
Innovation

Methods, ideas, or system contributions that make the work stand out.

Transforms density estimation into regression task
Uses auxiliary samples for nonparametric regression
Leverages neural networks and decision trees
A
Alexander G. Reisach
CNRS, MAP5, Université Paris Cité, F-75006 Paris, France
O
Olivier Collier
MODAL’X, UPL, UniversitĂ© Paris Nanterre, F-92000 Nanterre, France
Alex Luedtke
Alex Luedtke
Member of the Faculty, Harvard University
causal inferencemachine learningsemiparametricsautomated estimation
Antoine Chambaz
Antoine Chambaz
Professor of Statistics, MAP5 (UMR CNRS 8145), Université de Paris
Semiparametricsreinforcement learningbiostatistics