🤖 AI Summary
This study investigates the theoretical limits of noiseless kernel ridge regression (KRR) for high-fidelity scientific computing simulations. Addressing the classical saturation phenomenon—where convergence rates are jointly constrained by target function smoothness and eigenvalue decay of the kernel integral operator—the work makes three key contributions: (1) It establishes, for the first time, the “extra smoothness effect” in noiseless KRR and characterizes an adaptive saturation threshold; (2) it derives a noise-aware unified error bound that recovers the minimax-optimal rate—up to logarithmic factors—as noise vanishes; (3) it introduces a refined notion of degrees of freedom and rigorously characterizes the estimation error via Sobolev interpolation norms, spectral analysis of integral operators, and reproducing kernel Hilbert space (RKHS) theory. Comprehensive high-precision numerical experiments validate the predicted convergence rates and saturation boundaries, providing a rigorous theoretical foundation for optimal KRR deployment in deterministic modeling.
📝 Abstract
Kernel ridge regression (KRR), also known as the least-squares support vector machine, is a fundamental method for learning functions from finite samples. While most existing analyses focus on the noisy setting with constant-level label noise, we present a comprehensive study of KRR in the noiseless regime -- a critical setting in scientific computing where data are often generated via high-fidelity numerical simulations. We establish that, up to logarithmic factors, noiseless KRR achieves minimax optimal convergence rates, jointly determined by the eigenvalue decay of the associated integral operator and the target function's smoothness. These rates are derived under Sobolev-type interpolation norms, with the $L^2$ norm as a special case. Notably, we uncover two key phenomena: an extra-smoothness effect, where the KRR solution exhibits higher smoothness than typical functions in the native reproducing kernel Hilbert space (RKHS), and a saturation effect, where the KRR's adaptivity to the target function's smoothness plateaus beyond a certain level. Leveraging these insights, we also derive a novel error bound for noisy KRR that is noise-level aware and achieves minimax optimality in both noiseless and noisy regimes. As a key technical contribution, we introduce a refined notion of degrees of freedom, which we believe has broader applicability in the analysis of kernel methods. Extensive numerical experiments validate our theoretical results and provide insights beyond existing theory.