MIHT: A Hoeffding Tree for Time Series Classification using Multiple Instance Learning

📅 2026-03-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenges of modeling and interpretation in multivariate, variable-length time series classification, where high dimensionality and heterogeneous sequence lengths complicate learning. The authors propose a multi-instance learning approach based on a “bag-of-subsequences” representation, which—by integrating Hoeffding trees with multi-instance learning for the first time—yields an incremental, white-box decision model. This method effectively discriminates between informative temporal segments and noise, automatically identifying salient variables and critical time intervals while maintaining model compactness and enhancing interpretability. Experimental evaluation on 28 public datasets demonstrates that the proposed approach outperforms 11 state-of-the-art algorithms in overall accuracy, with particularly strong performance in high-dimensional settings.

Technology Category

Application Category

📝 Abstract
Due to the prevalence of temporal data and its inherent dependencies in many real-world problems, time series classification is of paramount importance in various domains. However, existing models often struggle with series of variable length or high dimensionality. This paper introduces the MIHT (Multi-instance Hoeffding Tree) algorithm, an efficient model that uses multi-instance learning to classify multivariate and variable-length time series while providing interpretable results. The algorithm uses a novel representation of time series as "bags of subseries," together with an optimization process based on incremental decision trees that distinguish relevant parts of the series from noise. This methodology extracts the underlying concept of series with multiple variables and variable lengths. The generated decision tree is a compact, white-box representation of the series' concept, providing interpretability insights into the most relevant variables and segments of the series. Experimental results demonstrate MIHT's superiority, as it outperforms 11 state-of-the-art time series classification models on 28 public datasets, including high-dimensional ones. MIHT offers enhanced accuracy and interpretability, making it a promising solution for handling complex, dynamic time series data.
Problem

Research questions and friction points this paper is trying to address.

time series classification
variable-length
high dimensionality
multivariate time series
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multiple Instance Learning
Hoeffding Tree
Time Series Classification
Interpretable Model
Variable-length Time Series
🔎 Similar Papers
No similar papers found.
A
Aurora Esteban
Dept. of Computer Science and Artificial Intelligence, Andalusian Research Institute in Data Science and Computational Intelligence, University of Cordoba, Spain
Amelia Zafra
Amelia Zafra
Associate Professor of Computer Science, University of Cordoba
Artificial IntelligenceMachine LearningData MiningEvolutionary Computation
S
Sebastián Ventura
Dept. of Computer Science and Artificial Intelligence, Andalusian Research Institute in Data Science and Computational Intelligence, University of Cordoba, Spain