The Expressive Limits of Diagonal SSMs for State-Tracking

📅 2026-03-02

📈 Citations: 0

✨ Influential: 0

career value

200K/year

🤖 AI Summary

This study investigates the expressive limits of diagonal state space models (SSMs) in sequential state tracking tasks, with a focus on their inability to model states governed by non-abelian groups. By integrating group theory and representation theory, the work precisely characterizes the class of state tracking tasks realizable by a k-layer complex-valued diagonal SSM: namely, those corresponding to solvable groups admitting a subnormal series of abelian factors of length at most k. Theoretically, it is proven that a single-layer SSM cannot track arbitrary non-abelian group states under finite precision. Although deeper architectures theoretically extend expressivity, empirical results demonstrate persistent practical learning limitations. This work establishes a fundamental connection between SSM expressiveness and the algebraic structure of underlying state spaces, revealing an intrinsic architectural constraint in current SSM designs.

Technology Category

Application Category

📝 Abstract

State-Space Models (SSMs) have recently been shown to achieve strong empirical performance on a variety of long-range sequence modeling tasks while remaining efficient and highly-parallelizable. However, the theoretical understanding of their expressive power remains limited. In this work, we study the expressivity of input-Dependent Complex-valued Diagonal (DCD) SSMs on sequential state-tracking tasks. We show that single-layer DCD SSMs cannot express state-tracking of any non-Abelian group at finite precision. More generally, we show that $k$-layer DCD SSMs can express state-tracking of a group if and only if that group has a subnormal series of length $k$, with Abelian factors. That is, we identify the precise expressivity range of $k$-layer DCD SSMs within the solvable groups. Empirically, we find that multi-layer models often fail to learn state-tracking for non-Abelian groups, highlighting a gap between expressivity and learnability.

Problem

Research questions and friction points this paper is trying to address.

State-Space Models

Expressivity

State-Tracking

Non-Abelian Groups

Diagonal SSMs

Innovation

Methods, ideas, or system contributions that make the work stand out.

State-Space Models

Expressivity

Non-Abelian Groups