Enhanced Diffusion Sampling: Efficient Rare Event Sampling and Free Energy Calculation with Diffusion Models

📅 2026-02-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the inefficient sampling of rare events—such as protein folding—in molecular dynamics, which severely hampers accurate computation of equilibrium thermodynamic quantities like free energy. The authors propose a diffusion model–based enhanced sampling framework that uniquely integrates concepts from umbrella sampling, free energy difference estimation, and metadynamics, yielding three novel algorithms: UmbrellaDiff, ΔG-Diff, and MetaDiff. These methods generate biased ensembles through controllable guidance and recover unbiased thermodynamic estimates via exact reweighting. Demonstrated on toy models, protein folding landscapes, and free energy calculations, the approach achieves high accuracy and efficiency, requiring only minutes to hours of GPU time to obtain equilibrium properties—thereby substantially narrowing the sampling gap for rare events.

Technology Category

Application Category

📝 Abstract
The rare-event sampling problem has long been the central limiting factor in molecular dynamics (MD), especially in biomolecular simulation. Recently, diffusion models such as BioEmu have emerged as powerful equilibrium samplers that generate independent samples from complex molecular distributions, eliminating the cost of sampling rare transition events. However, a sampling problem remains when computing observables that rely on states which are rare in equilibrium, for example folding free energies. Here, we introduce enhanced diffusion sampling, enabling efficient exploration of rare-event regions while preserving unbiased thermodynamic estimators. The key idea is to perform quantitatively accurate steering protocols to generate biased ensembles and subsequently recover equilibrium statistics via exact reweighting. We instantiate our framework in three algorithms: UmbrellaDiff (umbrella sampling with diffusion models), $Δ$G-Diff (free-energy differences via tilted ensembles), and MetaDiff (a batchwise analogue for metadynamics). Across toy systems, protein folding landscapes and folding free energies, our methods achieve fast, accurate, and scalable estimation of equilibrium properties within GPU-minutes to hours per system -- closing the rare-event sampling gap that remained after the advent of diffusion-model equilibrium samplers.
Problem

Research questions and friction points this paper is trying to address.

rare-event sampling
molecular dynamics
free energy calculation
equilibrium sampling
biomolecular simulation
Innovation

Methods, ideas, or system contributions that make the work stand out.

enhanced diffusion sampling
diffusion models
rare-event sampling
free energy calculation
exact reweighting
Y
Yu Xie
Microsoft Research AI for Science
Ludwig Winkler
Ludwig Winkler
Microsoft Research
Stochastic ProcessesProbabilistic MLMachine Learning
L
Lixin Sun
Microsoft Research AI for Science
S
Sarah Lewis
Microsoft Research AI for Science
A
Adam E. Foster
Microsoft Research AI for Science
J
José Jiménez Luna
Microsoft Research AI for Science
T
Tim Hempel
Microsoft Research AI for Science
Michael Gastegger
Michael Gastegger
Microsoft Research AI4Science
Machine LearningComputational Chemistry
Y
Yaoyi Chen
Microsoft Research AI for Science
I
Iryna Zaporozhets
Microsoft Research AI for Science
Cecilia Clementi
Cecilia Clementi
Professor of Physics, Freie Universität Berlin
Biophysics
Christopher M. Bishop
Christopher M. Bishop
Technical Fellow, Director of Microsoft Research AI for Science, Cambridge, U.K.
Machine learning
F
Frank Noé
Microsoft Research AI for Science