Beyond Ensembles: Simulating All-Atom Protein Dynamics in a Learned Latent Space

📅 2025-09-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Conventional enhanced sampling methods rely on predefined collective variables, hindering automatic identification and modeling of all-atom conformational evolution in proteins over long timescales. Method: We propose the first all-atom generative model embedded with a time propagator, featuring a unified encoder–propagator–decoder architecture in latent space. It systematically integrates autoregressive neural networks, score-guided Langevin dynamics, and Koopman linear operators within the LD-FPG framework. Contribution: This work achieves, for the first time, explicit time propagation of protein dynamics in latent space. We quantitatively characterize the trade-off among three propagator types in long-range stability and thermodynamic fidelity: autoregressive propagation yields the highest conformational stability; score-guided dynamics best reconstructs side-chain free-energy landscapes; and Koopman-based propagation provides an interpretable, lightweight baseline. The model accurately recovers free-energy landscapes and conformational distributions, establishing a new data-driven paradigm for protein dynamical modeling.

Technology Category

Application Category

📝 Abstract
Simulating the long-timescale dynamics of biomolecules is a central challenge in computational science. While enhanced sampling methods can accelerate these simulations, they rely on pre-defined collective variables that are often difficult to identify. A recent generative model, LD-FPG, demonstrated that this problem could be bypassed by learning to sample the static equilibrium ensemble as all-atom deformations from a reference structure, establishing a powerful method for all-atom ensemble generation. However, while this approach successfully captures a system's probable conformations, it does not model the temporal evolution between them. Here we extend LD-FPG with a temporal propagator that operates within the learned latent space and compare three classes: (i) score-guided Langevin dynamics, (ii) Koopman-based linear operators, and (iii) autoregressive neural networks. Within a unified encoder-propagator-decoder framework, we evaluate long-horizon stability, backbone and side-chain ensemble fidelity, and functional free-energy landscapes. Autoregressive neural networks deliver the most robust long rollouts; score-guided Langevin best recovers side-chain thermodynamics when the score is well learned; and Koopman provides an interpretable, lightweight baseline that tends to damp fluctuations. These results clarify the trade-offs among propagators and offer practical guidance for latent-space simulators of all-atom protein dynamics.
Problem

Research questions and friction points this paper is trying to address.

Simulating long-timescale all-atom protein dynamics efficiently
Overcoming reliance on pre-defined collective variables
Modeling temporal evolution between protein conformations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Latent-space temporal propagator for dynamics
Autoregressive neural networks enable robust rollouts
Score-guided Langevin recovers side-chain thermodynamics
A
Aditya Sengar
Signal Processing Laboratory (LTS2), EPFL, Lausanne, Switzerland
Ali Hariri
Ali Hariri
Senior Researcher at Huawei
Access ControlUsage ControlIoT SecurityNetwork SecurityData Spaces
Pierre Vandergheynst
Pierre Vandergheynst
Professor of Electrical Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL)
data sciencemachine learningartificial intelligencenetwork sciencecomputer vision
P
Patrick Barth
Institute of Bioengineering, EPFL, Lausanne, Switzerland; Ludwig Institute for Cancer Research, Lausanne, Switzerland