Online Functional Principal Component Analysis on a Multidimensional Domain

๐Ÿ“… 2025-05-04
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Online functional principal component analysis (FPCA) for high-dimensional functional data streams remains challenging due to computational intractability and lack of scalable, theoretically grounded frameworks on non-Euclidean domains. Method: This paper introduces the first scalable online FPCA framework operating directly on the Stiefel manifold. It combines tensor-product spline bases with penalty-based smoothness constraints, designs a Riemannian stochastic gradient descent algorithm, and incorporates adaptive momentum estimation and iterate averaging. A novel dynamic hyperparameter tuning strategy based on rolling-block validation is further proposed. Contribution/Results: Experiments on synthetic and real-world spatiotemporal datasets demonstrate that the method achieves 10โ€“100ร— speedup over state-of-the-art baselines while improving reconstruction accuracy by 35%โ€“52% (mean error reduction). It enables real-time dimensionality reduction and feature extraction, establishing a new paradigm for large-scale functional data stream analysisโ€”rigorous in theory and viable in practice.

Technology Category

Application Category

๐Ÿ“ Abstract
Multidimensional functional data streams arise in diverse scientific fields, yet their analysis poses significant challenges. We propose a novel online framework for functional principal component analysis that enables efficient and scalable modeling of such data. Our method represents functional principal components using tensor product splines, enforcing smoothness and orthonormality through a penalized framework on a Stiefel manifold. An efficient Riemannian stochastic gradient descent algorithm is developed, with extensions inspired by adaptive moment estimation and averaging techniques to accelerate convergence. Additionally, a dynamic tuning strategy for smoothing parameter selection is developed based on a rolling averaged block validation score that adapts to the streaming nature of the data. Extensive simulations and real-world applications demonstrate the flexibility and effectiveness of this framework for analyzing multidimensional functional data.
Problem

Research questions and friction points this paper is trying to address.

Online analysis of multidimensional functional data streams
Efficient modeling using tensor product splines and Stiefel manifold
Dynamic smoothing parameter selection for streaming data adaptation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Online FPCA using tensor product splines
Riemannian SGD with adaptive acceleration
Dynamic smoothing parameter tuning strategy
M
Muye Nanshan
Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, Canada; School of Data Science, Fudan University, Shanghai, China
N
Nan Zhang
School of Data Science, Fudan University, Shanghai, China; Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
Jiguo Cao
Jiguo Cao
Simon Fraser University
Functional Data AnalysisEstimating Differential EquationsMachine Learning