The Expressive Limits of Diagonal SSMs for State-Tracking

πŸ“… 2026-03-02
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This study investigates the expressive limits of diagonal state space models (SSMs) in sequential state tracking tasks, with a focus on their inability to model states governed by non-abelian groups. By integrating group theory and representation theory, the work precisely characterizes the class of state tracking tasks realizable by a k-layer complex-valued diagonal SSM: namely, those corresponding to solvable groups admitting a subnormal series of abelian factors of length at most k. Theoretically, it is proven that a single-layer SSM cannot track arbitrary non-abelian group states under finite precision. Although deeper architectures theoretically extend expressivity, empirical results demonstrate persistent practical learning limitations. This work establishes a fundamental connection between SSM expressiveness and the algebraic structure of underlying state spaces, revealing an intrinsic architectural constraint in current SSM designs.

Technology Category

Application Category

πŸ“ Abstract
State-Space Models (SSMs) have recently been shown to achieve strong empirical performance on a variety of long-range sequence modeling tasks while remaining efficient and highly-parallelizable. However, the theoretical understanding of their expressive power remains limited. In this work, we study the expressivity of input-Dependent Complex-valued Diagonal (DCD) SSMs on sequential state-tracking tasks. We show that single-layer DCD SSMs cannot express state-tracking of any non-Abelian group at finite precision. More generally, we show that $k$-layer DCD SSMs can express state-tracking of a group if and only if that group has a subnormal series of length $k$, with Abelian factors. That is, we identify the precise expressivity range of $k$-layer DCD SSMs within the solvable groups. Empirically, we find that multi-layer models often fail to learn state-tracking for non-Abelian groups, highlighting a gap between expressivity and learnability.
Problem

Research questions and friction points this paper is trying to address.

State-Space Models
Expressivity
State-Tracking
Non-Abelian Groups
Diagonal SSMs
Innovation

Methods, ideas, or system contributions that make the work stand out.

State-Space Models
Expressivity
Non-Abelian Groups
Solvable Groups
Diagonal SSMs
πŸ”Ž Similar Papers
2024-05-27Neural Information Processing SystemsCitations: 24
2024-04-12International Conference on Machine LearningCitations: 35