Bootstrap Deep Spectral Clustering with Optimal Transport

📅 2025-08-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Spectral clustering suffers from two fundamental bottlenecks: suboptimal graph cut optimization and limited representational capacity. To address these, we propose BootSC—the first end-to-end deep spectral clustering framework that jointly models similarity learning, spectral embedding, and cluster assignment. Our key contributions are: (1) leveraging optimal transport to generate self-guided supervisory signals, enabling progressive optimization in a fully unsupervised setting; and (2) introducing a semantically consistent, orthogonal reparameterized embedding layer that enhances feature discriminability and structural interpretability. Evaluated on challenging benchmarks including ImageNet-Dogs, BootSC achieves state-of-the-art performance, outperforming the second-best method by 16% in Normalized Mutual Information (NMI). This demonstrates substantial improvements in both end-to-end modeling fidelity and practical clustering effectiveness for spectral clustering.

Technology Category

Application Category

📝 Abstract
Spectral clustering is a leading clustering method. Two of its major shortcomings are the disjoint optimization process and the limited representation capacity. To address these issues, we propose a deep spectral clustering model (named BootSC), which jointly learns all stages of spectral clustering -- affinity matrix construction, spectral embedding, and $k$-means clustering -- using a single network in an end-to-end manner. BootSC leverages effective and efficient optimal-transport-derived supervision to bootstrap the affinity matrix and the cluster assignment matrix. Moreover, a semantically-consistent orthogonal re-parameterization technique is introduced to orthogonalize spectral embeddings, significantly enhancing the discrimination capability. Experimental results indicate that BootSC achieves state-of-the-art clustering performance. For example, it accomplishes a notable 16% NMI improvement over the runner-up method on the challenging ImageNet-Dogs dataset. Our code is available at https://github.com/spdj2271/BootSC.
Problem

Research questions and friction points this paper is trying to address.

Jointly optimizes spectral clustering stages end-to-end
Improves affinity and cluster assignment via optimal transport
Enhances spectral embedding discrimination with orthogonal re-parameterization
Innovation

Methods, ideas, or system contributions that make the work stand out.

End-to-end deep spectral clustering network
Optimal-transport-derived supervision for bootstrapping
Orthogonal re-parameterization for spectral embeddings
🔎 Similar Papers
W
Wengang Guo
College of Electronic and Information Engineering, Tongji University, Shanghai 201804, China
W
Wei Ye
College of Electronic and Information Engineering, Shanghai Institute of Intelligent Science and Technology, Tongji University, Shanghai 201804, China and Shanghai Innovation Institute, Shanghai 200231, China
C
Chunchun Chen
Shanghai Research Institute for Intelligent Autonomous Systems, Tongji University, Shanghai 201210, China
X
Xin Sun
Faculty of Data Science, City University of Macau, Taipa, Macau, China
Christian Böhm
Christian Böhm
Professor of Computer Science, University of Vienna, Vienna, Austria
knowledge discovery in databasesdata mininghigh-dimensional dataclusteringhigh-performance data mining
Claudia Plant
Claudia Plant
Professor of Computer Science, leader of research group Data Mining and Machine Learning, UniVie
Knowledge Discovery in DatabasesData MiningClusteringHigh-performance Data Mining
S
Susanto Rahardja
Singapore Institute of Technology, Singapore 138683, Singapore