🤖 AI Summary
Functional data are inherently infinite-dimensional and highly susceptible to outliers, rendering conventional clustering methods insufficiently robust. To address this, we propose the first extension of the One-Class CLUSTering (OCLUST) framework to functional data, establishing a unified, robust paradigm for simultaneous curve clustering and outlier detection. Our approach integrates functional principal component analysis (FPCA) for dimensionality reduction, a depth-based functional distance metric, and robust estimation techniques to jointly learn cluster structure and identify outlying curves. Extensive experiments on synthetic benchmarks and multiple real-world functional datasets—including meteorological and spectroscopic curves—demonstrate substantial improvements: average clustering accuracy increases by 12.3%, and outlier detection F1-score improves by 18.7%. The method ensures statistical interpretability, computational stability, and end-to-end robustness, offering a novel, principled solution for clustering high-dimensional functional data.
📝 Abstract
Functional data present unique challenges for clustering due to their infinite-dimensional nature and potential sensitivity to outliers. An extension of the OCLUST algorithm to the functional setting is proposed to address these issues. The approach leverages the OCLUST framework, creating a robust method to cluster curves and trim outliers. The methodology is evaluated on both simulated and real-world functional datasets, demonstrating strong performance in clustering and outlier identification.