A Hybridizable Neural Time Integrator for Stable Autoregressive Forecasting

๐Ÿ“… 2026-04-22
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

225K/year
๐Ÿค– AI Summary
This work addresses the instability in training and inference inherent to long-term autoregressive modeling of chaotic dynamical systems by proposing a structure-preserving approach that integrates a shooting-based hybrid finite element framework with an autoregressive Transformer. The method leverages a Vision Transformer to extract structure-preserving latent dynamics and incorporates a provably stable hybrid neural time integrator, which enforces discrete energy conservation and guarantees uniformly bounded gradients, thereby effectively mitigating gradient explosion. Evaluated on a โ€œmini-foundationโ€ fusion component model trained on only 12 simulations, the proposed approach achieves a 9,000ร— speedup and surpasses existing foundation models with 65ร— fewer parameters, significantly enhancing both long-term prediction stability and computational efficiency.

Technology Category

Application Category

๐Ÿ“ Abstract
For autoregressive modeling of chaotic dynamical systems over long time horizons, the stability of both training and inference is a major challenge in building scientific foundation models. We present a hybrid technique in which an autoregressive transformer is embedded within a novel shooting-based mixed finite element scheme, exposing topological structure that enables provable stability. For forward problems, we prove preservation of discrete energies, while for training we prove uniform bounds on gradients, provably avoiding the exploding gradient problem. Combined with a vision transformer, this yields latent tokens admitting structure-preserving dynamics. We outperform modern foundation models with a $65\times$ reduction in model parameters and long-horizon forecasting of chaotic systems. A "mini-foundation" model of a fusion component shows that 12 simulations suffice to train a real-time surrogate, achieving a $9{,}000\times$ speedup over particle-in-cell simulation.
Problem

Research questions and friction points this paper is trying to address.

autoregressive forecasting
chaotic dynamical systems
stability
long time horizons
scientific foundation models
Innovation

Methods, ideas, or system contributions that make the work stand out.

hybridizable neural integrator
structure-preserving dynamics
autoregressive forecasting
finite element scheme
gradient stability
B
Brooks Kinch
Mechanical Engineering and Applied Mechanics, University of Pennsylvania, Philadelphia, PA, USA
Xiaozhe Hu
Xiaozhe Hu
Tufts University
MultigridSpectral Graph TheoryFinite-element MethodPorous Media FlowBioinformatics
Y
Yilong Huang
Mechanical Engineering and Applied Mechanics, University of Pennsylvania, Philadelphia, PA, USA
M
Martine Dyring Hansen
SINTEF Digital, Oslo, Norway
S
Sunniva Meltzer
SINTEF Digital, Oslo, Norway
N
Nathaniel Donald Hamlin
Sandia National Laboratories, Albuquerque, NM, USA
D
David Sirajuddin
Sandia National Laboratories, Albuquerque, NM, USA
Eric C. Cyr
Eric C. Cyr
Computational Mathematics Department, Sandia National Laboratories
Computational SciencePreconditioningNumerical PDEsScientific Machine Learning
Nathaniel Trask
Nathaniel Trask
University of Pennsylvania
Scientific machine learningphysics compatible discretizationmeshless methodsmultiphysics