🤖 AI Summary
Traditional Robust Principal Component Analysis (RPCA) methods for sparse object segmentation suffer from high computational complexity, sensitivity to hyperparameters, and rigid priors, leading to poor adaptability in dynamic scenes. Method: This paper proposes an interpretable deep RPCA framework. It unrolls a relaxed RPCA model into a deep network comprising three modules—background approximation, sparse foreground extraction, and image reconstruction. A memory-augmented mechanism ensures temporal consistency of background estimates, while a deep contrastive prior module integrates saliency cues to accelerate foreground localization. Furthermore, a dual-path (visual and numerical) low-rank/sparse metric enhances discriminability. Contributions/Results: The method achieves state-of-the-art performance on multiple benchmark datasets, significantly outperforming existing approaches. It offers strong interpretability through modular design and demonstrates robustness across diverse imaging scenarios.
📝 Abstract
Robust principal component analysis (RPCA) decomposes an observation matrix into low-rank background and sparse object components. This capability has enabled its application in tasks ranging from image restoration to segmentation. However, traditional RPCA models suffer from computational burdens caused by matrix operations, reliance on finely tuned hyperparameters, and rigid priors that limit adaptability in dynamic scenarios. To solve these limitations, we propose RPCANet++, a sparse object segmentation framework that fuses the interpretability of RPCA with efficient deep architectures. Our approach unfolds a relaxed RPCA model into a structured network comprising a Background Approximation Module (BAM), an Object Extraction Module (OEM), and an Image Restoration Module (IRM). To mitigate inter-stage transmission loss in the BAM, we introduce a Memory-Augmented Module (MAM) to enhance background feature preservation, while a Deep Contrast Prior Module (DCPM) leverages saliency cues to expedite object extraction. Extensive experiments on diverse datasets demonstrate that RPCANet++ achieves state-of-the-art performance under various imaging scenarios. We further improve interpretability via visual and numerical low-rankness and sparsity measurements. By combining the theoretical strengths of RPCA with the efficiency of deep networks, our approach sets a new baseline for reliable and interpretable sparse object segmentation. Codes are available at our Project Webpage https://fengyiwu98.github.io/rpcanetx.