🤖 AI Summary
Existing topological clustering methods struggle to effectively model higher-order topological evolution when confronted with complex, interwoven geometric structures—such as nested loops and multiscale connectivity—leading to unstable performance. To address this, we propose an unsupervised clustering framework grounded in the Vietoris–Rips complex and Betti sequences: for the first time, we treat the sequence of Betti numbers across filtration scales as a learnable topological feature; integrate Betti-number-based filtering to dynamically identify topologically similar neighborhoods; and employ a graph-driven mechanism to refine cluster assignments. Our approach overcomes the adaptability bottleneck of conventional persistent homology–based clustering on highly entangled data. Extensive experiments on multiple synthetic and real-world benchmarks demonstrate significant improvements over state-of-the-art methods, achieving an average 12.6% gain in clustering accuracy. Notably, the method exhibits superior robustness and expressive power in discriminating non-Euclidean structural patterns.
📝 Abstract
Clustering aims to form groups of similar data points in an unsupervised regime. Yet, clustering complex datasets containing critically intertwined shapes poses significant challenges. The prevailing clustering algorithms widely depend on evaluating similarity measures based on Euclidean metrics. Exploring topological characteristics to perform clustering of complex datasets inevitably presents a better scope. The topological clustering algorithms predominantly perceive the point set through the lens of Simplicial complexes and Persistent homology. Despite these approaches, the existing topological clustering algorithms cannot somehow fully exploit topological structures and show inconsistent performances on some highly complicated datasets. This work aims to mitigate the limitations by identifying topologically similar neighbors through the Vietoris-Rips complex and Betti number filtration. In addition, we introduce the concept of the Betti sequences to capture flexibly essential features from the topological structures. Our proposed algorithm is adept at clustering complex, intertwined shapes contained in the datasets. We carried out experiments on several synthetic and real-world datasets. Our algorithm demonstrated commendable performances across the datasets compared to some of the well-known topology-based clustering algorithms.