🤖 AI Summary
This paper addresses the joint estimation of change-points—both locations and magnitudes—in multidimensional regression surfaces. We propose Free Discontinuity Regression (FDR), a nonparametric method that simultaneously achieves surface smoothing, domain segmentation, and precise recovery of jump parameters. Our key contributions are: (i) the first unified identifiability and uniform consistency guarantees for multivariate jump surfaces; (ii) relaxation of classical image segmentation assumptions—namely, fixed grids and i.i.d. noise—enabling flexible spatial sampling and robust handling of correlated noise; and (iii) theoretical consistency established under Special Bounded Variation (SBV) regularity, integrating convex relaxation of the Mumford–Shah functional, stochastic spatial sampling modeling, correlation-robust estimation, and SURE-driven automatic hyperparameter selection. Extensive 3D large-scale simulations validate theoretical performance. Empirical analysis of India’s internet shutdowns accurately identifies administrative boundaries and estimates a 25–35% decline in economic activity—outperforming state-of-the-art methods.
📝 Abstract
Sharp, multidimensional changepoints-abrupt shifts in a regression surface whose locations and magnitudes are unknown-arise in settings as varied as gene-expression profiling, financial covariance breaks, climate-regime detection, and urban socioeconomic mapping. Despite their prevalence, there are no current approaches that jointly estimate the location and size of the discontinuity set in a one-shot approach with statistical guarantees. We therefore introduce Free Discontinuity Regression (FDR), a fully nonparametric estimator that simultaneously (i) smooths a regression surface, (ii) segments it into contiguous regions, and (iii) provably recovers the precise locations and sizes of its jumps. By extending a convex relaxation of the Mumford-Shah functional to random spatial sampling and correlated noise, FDR overcomes the fixed-grid and i.i.d. noise assumptions of classical image-segmentation approaches, thus enabling its application to real-world data of any dimension. This yields the first identification and uniform consistency results for multivariate jump surfaces: under mild SBV regularity, the estimated function, its discontinuity set, and all jump sizes converge to their true population counterparts. Hyperparameters are selected automatically from the data using Stein's Unbiased Risk Estimate, and large-scale simulations up to three dimensions validate the theoretical results and demonstrate good finite-sample performance. Applying FDR to an internet shutdown in India reveals a 25-35% reduction in economic activity around the estimated shutdown boundaries-much larger than previous estimates. By unifying smoothing, segmentation, and effect-size recovery in a general statistical setting, FDR turns free-discontinuity ideas into a practical tool with formal guarantees for modern multivariate data.