Incomplete Multi-view Clustering via Diffusion Contrastive Generation

📅 2025-03-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Incomplete multi-view clustering (IMVC), existing methods face two key bottlenecks under high missing rates (≥80%): heavy reliance on strong paired supervision and insufficient generation diversity and discriminability. Method: We propose an end-to-end diffusion-contrastive joint framework. It innovatively couples diffusion models with cross-view contrastive learning to enable view reconstruction under arbitrary missing patterns—without requiring paired samples. We further introduce dual-granularity (instance-level and class-level) interactive representation learning to jointly optimize generative fidelity and clustering performance. Forward noise perturbation and reverse denoising, multi-view consistency constraints, and unified optimization collectively enhance both the diversity and separability of generated views. Contribution/Results: Our method demonstrates robust clustering performance under extreme missingness (≥80%) across multiple benchmark datasets, consistently surpassing state-of-the-art approaches in clustering accuracy.

Technology Category

Application Category

📝 Abstract
Incomplete multi-view clustering (IMVC) has garnered increasing attention in recent years due to the common issue of missing data in multi-view datasets. The primary approach to address this challenge involves recovering the missing views before applying conventional multi-view clustering methods. Although imputation-based IMVC methods have achieved significant improvements, they still encounter notable limitations: 1) heavy reliance on paired data for training the data recovery module, which is impractical in real scenarios with high missing data rates; 2) the generated data often lacks diversity and discriminability, resulting in suboptimal clustering results. To address these shortcomings, we propose a novel IMVC method called Diffusion Contrastive Generation (DCG). Motivated by the consistency between the diffusion and clustering processes, DCG learns the distribution characteristics to enhance clustering by applying forward diffusion and reverse denoising processes to intra-view data. By performing contrastive learning on a limited set of paired multi-view samples, DCG can align the generated views with the real views, facilitating accurate recovery of views across arbitrary missing view scenarios. Additionally, DCG integrates instance-level and category-level interactive learning to exploit the consistent and complementary information available in multi-view data, achieving robust and end-to-end clustering. Extensive experiments demonstrate that our method outperforms state-of-the-art approaches.
Problem

Research questions and friction points this paper is trying to address.

Addresses incomplete multi-view clustering with missing data
Improves data recovery and diversity in generated views
Enhances clustering accuracy through contrastive and interactive learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses diffusion and denoising for data recovery
Applies contrastive learning on paired samples
Integrates instance and category-level learning
🔎 Similar Papers
No similar papers found.
Yuanyang Zhang
Yuanyang Zhang
Ph.D Student, Southeast University
Multi-view LearningMulti-modal LearningAnomaly DetectionPerson Re-Identification
Y
Yijie Lin
College of Computer Science, Sichuan University
W
Weiqing Yan
School of Computer and Control Engineering, Yantai University
L
Li Yao
School of Computer Science and Engineering, Southeast University, Nanjing 210096, China; Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China
Xinhang Wan
Xinhang Wan
National University of Defense Technology
multi-view clusteringcontinual learningactive learning
Guangyuan Li
Guangyuan Li
Zhejiang University
Low-Level VisionMedical Image AnalysisVideo Generation
C
Chao Zhang
Department of Control Science and Intelligence Engineering, Nanjing University
Guanzhou Ke
Guanzhou Ke
Beijing Jiaotong University & Singapore Management University
visual understanding & generationmulti-modality learningmulti-view learning
J
Jie Xu
School of Computer Science and Engineering, University of Electronic Science and Technology of China