Time-Delayed Transformers for Data-Driven Modeling of Low-Dimensional Dynamics

📅 2026-02-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of effectively modeling nonlinear and even chaotic low-dimensional spatiotemporal dynamical systems while preserving the efficiency and interpretability of linear models. The authors propose the Time-Delay Transformer (TD-TF), an extremely minimalist single-layer, single-head Transformer architecture that extends Time-Delay Dynamic Mode Decomposition (TD-DMD) to the nonlinear regime for the first time. TD-TF comprises only one self-attention layer—with a single query per time step—and one feedforward layer, achieving linear sequence complexity and a low parameter count. Experiments on synthetic signals, unsteady aerodynamic flows, the Lorenz '63 system, and reaction–diffusion dynamics demonstrate that TD-TF matches classical methods in linear regimes and substantially outperforms linear baselines in nonlinear and chaotic settings, accurately capturing long-term dynamical behavior.

Technology Category

Application Category

📝 Abstract
We propose the time-delayed transformer (TD-TF), a simplified transformer architecture for data-driven modeling of unsteady spatio-temporal dynamics. TD-TF bridges linear operator-based methods and deep sequence models by showing that a single-layer, single-head transformer can be interpreted as a nonlinear generalization of time-delayed dynamic mode decomposition (TD-DMD). The architecture is deliberately minimal, consisting of one self-attention layer with a single query per prediction and one feedforward layer, resulting in linear computational complexity in sequence length and a small parameter count. Numerical experiments demonstrate that TD-TF matches the performance of strong linear baselines on near-linear systems, while significantly outperforming them in nonlinear and chaotic regimes, where it accurately captures long-term dynamics. Validation studies on synthetic signals, unsteady aerodynamics, the Lorenz'63 system, and a reaction-diffusion model show that TD-TF preserves the interpretability and efficiency of linear models while providing substantially enhanced expressive power for complex dynamics.
Problem

Research questions and friction points this paper is trying to address.

data-driven modeling
low-dimensional dynamics
nonlinear dynamics
chaotic systems
spatio-temporal dynamics
Innovation

Methods, ideas, or system contributions that make the work stand out.

Time-Delayed Transformer
Dynamic Mode Decomposition
Data-Driven Modeling
Nonlinear Dynamics
Spatio-Temporal Modeling
🔎 Similar Papers
2023-12-17Bulletin of the American Mathematical SocietyCitations: 59