Energy-Based Flow Matching for Generating 3D Molecular Structure

๐Ÿ“… 2025-08-26
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the fundamental problem of 3D molecular conformation generation in computational biology. We propose an energy-guided flow matching framework that models molecular conformations as distributions on an energy-driven manifold, learning a differentiable, iterative mapping from random initial configurations to target conformations via deep neural networks. Crucially, physics-inspired energy functions are explicitly incorporated during both training and inference to ensure idempotency and numerical stability; additionally, an AlphaFold-inspired structural refinement strategy is integrated to enhance geometric validity. Compared to state-of-the-art flow matching and diffusion-based methods, our approach achieves new SOTA performance on proteinโ€“ligand docking and protein backbone conformation generation tasks, while maintaining comparable computational cost.

Technology Category

Application Category

๐Ÿ“ Abstract
Molecular structure generation is a fundamental problem that involves determining the 3D positions of molecules' constituents. It has crucial biological applications, such as molecular docking, protein folding, and molecular design. Recent advances in generative modeling, such as diffusion models and flow matching, have made great progress on these tasks by modeling molecular conformations as a distribution. In this work, we focus on flow matching and adopt an energy-based perspective to improve training and inference of structure generation models. Our view results in a mapping function, represented by a deep network, that is directly learned to extit{iteratively} map random configurations, i.e. samples from the source distribution, to target structures, i.e. points in the data manifold. This yields a conceptually simple and empirically effective flow matching setup that is theoretically justified and has interesting connections to fundamental properties such as idempotency and stability, as well as the empirically useful techniques such as structure refinement in AlphaFold. Experiments on protein docking as well as protein backbone generation consistently demonstrate the method's effectiveness, where it outperforms recent baselines of task-associated flow matching and diffusion models, using a similar computational budget.
Problem

Research questions and friction points this paper is trying to address.

Generating 3D molecular structures from random configurations
Improving training and inference of flow matching models
Addressing molecular docking and protein backbone generation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Energy-based flow matching for 3D molecules
Iterative deep network mapping random configurations
Theoretically justified with idempotency and stability
๐Ÿ”Ž Similar Papers
No similar papers found.
W
Wenyin Zhou
KTH Royal Institute of Technology, Stockholm, Sweden
Christopher Iliffe Sprague
Christopher Iliffe Sprague
The Alan Turing Institute
Artificial IntelligenceControl TheoryMachine LearningHybrid Systems
V
Vsevolod Viliuga
Science for Life Laboratory, Sweden; DBB at Stockholm University, Sweden; Max Planck Institute for Polymer Research, Mainz, Germany
M
Matteo Tadiello
Science for Life Laboratory, Sweden; DBB at Stockholm University, Sweden
Arne Elofsson
Arne Elofsson
Science for Life Laboratory and department of Biochemistry and Biophysics, Stockholm University
Protein structurefoldingevolution and interactionsmachine learnig
Hossein Azizpour
Hossein Azizpour
Associate Professor, KTH (Royal Institute of Technology)
Computer VisionDeep LearningLife SciencesSustainable Development