🤖 AI Summary
Classification of vector-valued functional data with scalar covariates is challenging under irregular sampling, as conventional semiparametric methods rely on restrictive linear assumptions and fixed basis expansions, leading to substantial performance degradation.
Method: We propose Path Signature Logistic Regression (PSLR), a geometrically principled approach grounded in rough path theory. PSLR employs truncated path signatures to construct finite-dimensional, basis-free representations that inherently capture temporal nonlinearities and inter-channel dependencies, while remaining robust to irregular sampling. It integrates time-augmented path embeddings with semiparametric logistic regression to ensure interpretability.
Contribution/Results: We theoretically establish the existence of an optimal truncation level and prove consistency of the estimator. Empirical evaluations on synthetic and real-world datasets demonstrate that PSLR significantly outperforms state-of-the-art functional classification methods—particularly under non-uniform sampling—achieving superior accuracy, robustness, and interpretability.
📝 Abstract
We propose Path Signatures Logistic Regression (PSLR), a semi-parametric framework for classifying vector-valued functional data with scalar covariates. Classical functional logistic regression models rely on linear assumptions and fixed basis expansions, which limit flexibility and degrade performance under irregular sampling. PSLR overcomes these issues by leveraging truncated path signatures to construct a finite-dimensional, basis-free representation that captures nonlinear and cross-channel dependencies. By embedding trajectories as time-augmented paths, PSLR extracts stable, geometry-aware features that are robust to sampling irregularity without requiring a common time grid, while still preserving subject-specific timing patterns. We establish theoretical guarantees for the existence and consistent estimation of the optimal truncation order, along with non-asymptotic risk bounds. Experiments on synthetic and real-world datasets show that PSLR outperforms traditional functional classifiers in accuracy, robustness, and interpretability, particularly under non-uniform sampling schemes. Our results highlight the practical and theoretical benefits of integrating rough path theory into modern functional data analysis.