From STLS to Projection-based Dictionary Selection in Sparse Regression for System Identification

📅 2025-12-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In SINDy-type sparse system identification, dictionary selection lacks theoretical grounding, compromising model accuracy and interpretability. Method: This paper proposes an adaptive dictionary pruning strategy based on projection error scores. It establishes, for the first time, a unified theoretical framework for score-driven dictionary selection, jointly modeling reconstruction error and dictionary mutual coherence. The framework integrates sequential thresholded least squares (STLS), ℓ₀ sparse optimization, and proximal gradient methods directly into the SINDy pipeline. Contributions/Results: (1) It enables interpretable, data-adaptive sparse regression; (2) it significantly improves model accuracy and physical consistency in both ordinary and partial differential equation system identification tasks; and (3) it enhances robustness to noise while providing reusable, principled criteria for dictionary refinement.

Technology Category

Application Category

📝 Abstract
In this work, we revisit dictionary-based sparse regression, in particular, Sequential Threshold Least Squares (STLS), and propose a score-guided library selection to provide practical guidance for data-driven modeling, with emphasis on SINDy-type algorithms. STLS is an algorithm to solve the $ell_0$ sparse least-squares problem, which relies on splitting to efficiently solve the least-squares portion while handling the sparse term via proximal methods. It produces coefficient vectors whose components depend on both the projected reconstruction errors, here referred to as the scores, and the mutual coherence of dictionary terms. The first contribution of this work is a theoretical analysis of the score and dictionary-selection strategy. This could be understood in both the original and weak SINDy regime. Second, numerical experiments on ordinary and partial differential equations highlight the effectiveness of score-based screening, improving both accuracy and interpretability in dynamical system identification. These results suggest that integrating score-guided methods to refine the dictionary more accurately may help SINDy users in some cases to enhance their robustness for data-driven discovery of governing equations.
Problem

Research questions and friction points this paper is trying to address.

Improves dictionary selection in sparse regression for system identification
Enhances accuracy and interpretability in dynamical system identification
Integrates score-guided methods to refine SINDy algorithms for robustness
Innovation

Methods, ideas, or system contributions that make the work stand out.

Score-guided dictionary selection for sparse regression
Theoretical analysis of score and dictionary selection strategy
Numerical experiments on differential equations for validation
🔎 Similar Papers
No similar papers found.
H
Hangjun Cho
AI Institute in Dynamic Systems, Department of Mechanical Engineering, University of Washington, Seattle, WA 98195, United States
F
Fabio V. G. Amaral
Departamento de Matemática e Computação, Faculdade de Ciências e Tecnologia, Universidade Estadual Paulista “Júlio de Mesquita Filho”, Presidente Prudente, Brazil
A
Andrei A. Klishin
Department of Mechanical Engineering, University of Hawai‘i at Mānoa, Honolulu 96822, United States
Cassio M. Oishi
Cassio M. Oishi
Associate Professor, Sao Paulo State University, UNESP, Brazil
computational fluid dynamics and numerical solution of PDE
Steven L. Brunton
Steven L. Brunton
Professor, University of Washington
Dynamical systemsControlFluid dynamicsMachine learningModel Reduction