Wittgenstein's Family Resemblance Clustering Algorithm

📅 2026-01-03
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes WFR, a novel graph-based nonlinear clustering algorithm inspired by Wittgenstein’s notion of “family resemblance,” which addresses the limitations of traditional clustering methods that often require a predefined number of clusters or impose strong assumptions on cluster shape. WFR formalizes the philosophical concept of family resemblance by constructing a similarity graph, applying thresholding, and analyzing connected components to automatically discover clusters of arbitrary shape without prior knowledge of the cluster count. The method is further extended via kernelization to Kernel WFR to capture complex nonlinear relationships. Experimental results on standard benchmarks demonstrate that WFR effectively uncovers intricate semantic structures and exhibits strong robustness to unknown cluster numbers and irregular cluster geometries.

Technology Category

Application Category

📝 Abstract
This paper, introducing a novel method in philomatics, draws on Wittgenstein's concept of family resemblance from analytic philosophy to develop a clustering algorithm for machine learning. According to Wittgenstein's Philosophical Investigations (1953), family resemblance holds that members of a concept or category are connected by overlapping similarities rather than a single defining property. Consequently, a family of entities forms a chain of items sharing overlapping traits. This philosophical idea naturally lends itself to a graph-based approach in machine learning. Accordingly, we propose the Wittgenstein's Family Resemblance (WFR) clustering algorithm and its kernel variant, kernel WFR. This algorithm computes resemblance scores between neighboring data instances, and after thresholding these scores, a resemblance graph is constructed. The connected components of this graph define the resulting clusters. Simulations on benchmark datasets demonstrate that WFR is an effective nonlinear clustering algorithm that does not require prior knowledge of the number of clusters or assumptions about their shapes.
Problem

Research questions and friction points this paper is trying to address.

clustering
family resemblance
nonlinear clustering
cluster shape assumption
number of clusters
Innovation

Methods, ideas, or system contributions that make the work stand out.

family resemblance
graph-based clustering
nonlinear clustering
Wittgenstein
kernel method
🔎 Similar Papers
No similar papers found.
G
Golbahar Amanpour
Waterloo, Ontario, Canada
Benyamin Ghojogh
Benyamin Ghojogh
AI Scientist
Machine LearningDeep LearningTheory