Conformalized Robust Principal Component Analysis

📅 2026-03-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the lack of uncertainty quantification for recovered matrix entries in robust principal component analysis (RPCA) by proposing CP-RPCA, a novel framework that introduces conformal prediction to RPCA for the first time. CP-RPCA provides finite-sample coverage guarantees without requiring distributional assumptions and incorporates a weighted calibration mechanism to accommodate heterogeneous observation probabilities. The method supports both split and full conformal implementations and maintains robustness in the presence of missing data, outliers, or model misspecification. Experimental results demonstrate that CP-RPCA yields reliable and informative prediction intervals across diverse challenging scenarios while retaining high efficiency under correctly specified models, thereby confirming its scalability and practical utility.

Technology Category

Application Category

📝 Abstract
Robust principal component analysis (RPCA) is a widely used technique for recovering low-rank structure from matrices with missing entries and sparse, possibly large-magnitude corruptions. Although numerous algorithms achieve accurate point estimation, they offer little guidance on the uncertainty of recovered entries, limiting their reliability in practice. In this paper, we propose conformal prediction-RPCA (CP-RPCA), a practical and distribution-free framework for uncertainty quantification in robust matrix recovery. Our proposed method supports both split and full conformal implementations and incorporates weighted calibration to handle heterogeneous observation probabilities. We provide theoretical guarantees for finite-sample coverage and demonstrate through extensive simulations that CP-RPCA delivers reliable uncertainty quantification under severe outliers, missing data and model misspecification. Empirical results show that CP-RPCA can produce informative intervals and remain competitive in efficiency when the RPCA model is well specified, making it a scalable and robust tool for uncertainty-aware matrix analysis.
Problem

Research questions and friction points this paper is trying to address.

Robust Principal Component Analysis
Uncertainty Quantification
Matrix Recovery
Conformal Prediction
Missing Data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Conformal Prediction
Robust PCA
Uncertainty Quantification
Matrix Completion
Distribution-Free Inference
L
Liangliang Yuan
School of Mathematics and Statistics, Nanjing University of Science and Technology, China
L
Lei Wang
School of Statistics and Data Science, KLMDASR, LEBPS and LPMC, Nankai University, China
Q
Quan Kong
School of Cyber Science and Engineering, Nanjing University of Science and Technology, China
Liuhua Peng
Liuhua Peng
the University of Melbourne
Statistics