🤖 AI Summary
This work proposes WFR, a novel graph-based nonlinear clustering algorithm inspired by Wittgenstein’s notion of “family resemblance,” which addresses the limitations of traditional clustering methods that often require a predefined number of clusters or impose strong assumptions on cluster shape. WFR formalizes the philosophical concept of family resemblance by constructing a similarity graph, applying thresholding, and analyzing connected components to automatically discover clusters of arbitrary shape without prior knowledge of the cluster count. The method is further extended via kernelization to Kernel WFR to capture complex nonlinear relationships. Experimental results on standard benchmarks demonstrate that WFR effectively uncovers intricate semantic structures and exhibits strong robustness to unknown cluster numbers and irregular cluster geometries.
📝 Abstract
This paper, introducing a novel method in philomatics, draws on Wittgenstein's concept of family resemblance from analytic philosophy to develop a clustering algorithm for machine learning. According to Wittgenstein's Philosophical Investigations (1953), family resemblance holds that members of a concept or category are connected by overlapping similarities rather than a single defining property. Consequently, a family of entities forms a chain of items sharing overlapping traits. This philosophical idea naturally lends itself to a graph-based approach in machine learning. Accordingly, we propose the Wittgenstein's Family Resemblance (WFR) clustering algorithm and its kernel variant, kernel WFR. This algorithm computes resemblance scores between neighboring data instances, and after thresholding these scores, a resemblance graph is constructed. The connected components of this graph define the resulting clusters. Simulations on benchmark datasets demonstrate that WFR is an effective nonlinear clustering algorithm that does not require prior knowledge of the number of clusters or assumptions about their shapes.