Automatically Identify and Rectify: Robust Deep Contrastive Multi-view Clustering in Noisy Scenarios

📅 2025-05-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the significant performance degradation of multi-view clustering under real-world noisy views, this paper proposes AIRMVC, an end-to-end robust framework that— for the first time—formulates noisy view identification as a Gaussian Mixture Model (GMM)-driven anomaly detection task. It introduces a hybrid correction strategy and a noise-robust contrastive learning mechanism, with theoretical analysis proving their effectiveness in filtering out noise-induced distortions. By unifying multi-view representation learning, self-supervised correction, and anomaly-aware fusion, AIRMVC achieves state-of-the-art performance across six benchmark datasets, improving average clustering accuracy by over 8% under high-noise conditions. The source code is publicly available.

Technology Category

Application Category

📝 Abstract
Leveraging the powerful representation learning capabilities, deep multi-view clustering methods have demonstrated reliable performance by effectively integrating multi-source information from diverse views in recent years. Most existing methods rely on the assumption of clean views. However, noise is pervasive in real-world scenarios, leading to a significant degradation in performance. To tackle this problem, we propose a novel multi-view clustering framework for the automatic identification and rectification of noisy data, termed AIRMVC. Specifically, we reformulate noisy identification as an anomaly identification problem using GMM. We then design a hybrid rectification strategy to mitigate the adverse effects of noisy data based on the identification results. Furthermore, we introduce a noise-robust contrastive mechanism to generate reliable representations. Additionally, we provide a theoretical proof demonstrating that these representations can discard noisy information, thereby improving the performance of downstream tasks. Extensive experiments on six benchmark datasets demonstrate that AIRMVC outperforms state-of-the-art algorithms in terms of robustness in noisy scenarios. The code of AIRMVC are available at https://github.com/xihongyang1999/AIRMVC on Github.
Problem

Research questions and friction points this paper is trying to address.

Automatically identify and rectify noisy data in multi-view clustering
Improve clustering performance in noisy real-world scenarios
Develop robust contrastive mechanisms for reliable representation learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

GMM-based noisy data identification
Hybrid rectification strategy for noise
Noise-robust contrastive representation mechanism
🔎 Similar Papers
No similar papers found.
Xihong Yang
Xihong Yang
NUDT & NUS
Graph Neural NetworkRecommender SystemMulti-modal/view Learning
Siwei Wang
Siwei Wang
National University of Defense Technology
Large-graph studymulti-view fusionmulti-view clustering
F
Fangdi Wang
School of Computer, National University of Defense Technology, Changsha, Hunan, China
J
Jiaqi Jin
School of Computer, National University of Defense Technology, Changsha, Hunan, China
Suyuan Liu
Suyuan Liu
National University of Defense Technology
Multi-view ClusteringAnchor LearningGraph Learning
Y
Yue Liu
National University of Singapore, Singapore
E
En Zhu
School of Computer, National University of Defense Technology, Changsha, Hunan, China
X
Xinwang Liu
School of Computer, National University of Defense Technology, Changsha, Hunan, China
Yueming Jin
Yueming Jin
Assistant Professor, National University of Singapore
Medical Image AnalysisSurgical AI&RoboticsMultimodal Learning