🤖 AI Summary
High-dimensional unlabeled motion data pose significant challenges for effective modeling and interpretation using conventional exploratory data analysis (EDA) techniques. Method: This paper proposes an interpretable EDA framework integrating a multi-level motion variable taxonomy with compositional anomaly detection. Contribution/Results: First, it introduces a novel hierarchical taxonomy of motion features, enabling structured modeling from macro-level behavioral categories to micro-level dynamical patterns. Second, it pioneers the coupling of hierarchical classification with node-level anomaly detection, supporting two-stage behavioral clustering—initially by kinematic, geometric, or hybrid criteria, then refined by velocity, acceleration, or indentation features. Evaluated on four heterogeneous motion datasets, the framework meets predefined validity criteria on three, demonstrating substantial improvements in automated hierarchical pattern recognition and semantic interpretability of motion behaviors.
📝 Abstract
Movement data is prevalent across various applications and scientific fields, often characterized by its massive scale and complexity. Exploratory Data Analysis (EDA) plays a crucial role in summarizing and describing such data, enabling researchers to generate insights and support scientific hypotheses. Despite its importance, traditional EDA practices face limitations when applied to high-dimensional, unlabeled movement data. The complexity and multi-faceted nature of this type of data require more advanced methods that go beyond the capabilities of current EDA techniques. This study addresses the gap in current EDA practices by proposing a novel approach that leverages movement variable taxonomies and outlier detection. We hypothesize that organizing movement features into a taxonomy, and applying anomaly detection to combinations of taxonomic nodes, can reveal meaningful patterns and lead to more interpretable descriptions of the data. To test this hypothesis, we introduce TUMD, a new method that integrates movement taxonomies with outlier detection to enhance data analysis and interpretation. TUMD was evaluated across four diverse datasets of moving objects using fixed parameter values. Its effectiveness was assessed through two passes: the first pass categorized the majority of movement patterns as Kinematic, Geometric, or Hybrid for all datasets, while the second pass refined these behaviors into more specific categories such as Speed, Acceleration, or Indentation. TUMD met the effectiveness criteria in three datasets, demonstrating its ability to describe and refine movement behaviors. The results confirmed our hypothesis, showing that the combination of movement taxonomies and anomaly detection successfully uncovers meaningful and interpretable patterns within high-dimensional, unlabeled movement data.