Overfitting Regimes of Nadaraya-Watson Interpolators

📅 2025-02-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work investigates the generalization behavior of the Nadaraya–Watson (NW) estimator under noisy data, revealing a bandwidth-dependent non-monotonic overfitting spectrum: catastrophic → benign → mild. Methodologically, we employ nonparametric kernel regression analysis, asymptotic statistical derivations, and bandwidth sensitivity modeling to rigorously characterize the phase transitions governing overfitting regimes. We establish, for the first time, precise theoretical boundary conditions distinguishing these three overfitting phases. Comprehensive numerical experiments fully reproduce and validate all theoretical predictions. Our results challenge the conventional wisdom that overfitting inevitably harms generalization, formally establishing the “benign overfitting” paradigm within interpolation learning. By providing an analytically tractable benchmark, this work offers novel insights into the generalization mechanisms of modern overparameterized models.

Technology Category

Application Category

📝 Abstract
In recent years, there has been much interest in understanding the generalization behavior of interpolating predictors, which overfit on noisy training data. Whereas standard analyses are concerned with whether a method is consistent or not, recent observations have shown that even inconsistent predictors can generalize well. In this work, we revisit the classic interpolating Nadaraya-Watson (NW) estimator (also known as Shepard's method), and study its generalization capabilities through this modern viewpoint. In particular, by varying a single bandwidth-like hyperparameter, we prove the existence of multiple overfitting behaviors, ranging non-monotonically from catastrophic, through benign, to tempered. Our results highlight how even classical interpolating methods can exhibit intricate generalization behaviors. Numerical experiments complement our theory, demonstrating the same phenomena.
Problem

Research questions and friction points this paper is trying to address.

Analyze overfitting in Nadaraya-Watson interpolators
Study generalization across varying hyperparameters
Identify multiple overfitting regimes in classical methods
Innovation

Methods, ideas, or system contributions that make the work stand out.

Nadaraya-Watson estimator usage
Bandwidth hyperparameter variation
Multiple overfitting behaviors analysis
🔎 Similar Papers
No similar papers found.