Zero-shot protein stability prediction by inverse folding models: a free energy interpretation

📅 2025-06-05

📈 Citations: 0

✨ Influential: 0

career value

214K/year

🤖 AI Summary

This study addresses the zero-shot prediction of protein thermodynamic stability. We reveal that the amino acid preferences of inverse folding models (e.g., ProteinMPNN) fundamentally reflect sequence contributions to folding free energy. Departing from prior work that approximates log-likelihood ratios as free energy differences heuristically, we derive—based on rigorous statistical mechanics—a physically grounded evaluation pathway centered on free energy differences, enabling physical consistency recalibration of model outputs. Through lightweight post-hoc correction alone, our method significantly improves zero-shot prediction performance across multiple benchmarks—including ThermoNet, DeepDDG, and ProTherm—achieving average Spearman correlation gains of 0.12–0.23. Crucially, it establishes, for the first time, an interpretable and quantifiable physical linkage between inverse folding model outputs and thermodynamic stability. This advances protein design in data-scarce regimes by introducing a new physics-informed paradigm.

Technology Category

Application Category

📝 Abstract

Inverse folding models have proven to be highly effective zero-shot predictors of protein stability. Despite this success, the link between the amino acid preferences of an inverse folding model and the free-energy considerations underlying thermodynamic stability remains incompletely understood. A better understanding would be of interest not only from a theoretical perspective, but also potentially provide the basis for stronger zero-shot stability prediction. In this paper, we take steps to clarify the free-energy foundations of inverse folding models. Our derivation reveals the standard practice of likelihood ratios as a simplistic approximation and suggests several paths towards better estimates of the relative stability. We empirically assess these approaches and demonstrate that considerable gains in zero-shot performance can be achieved with fairly simple means.

Problem

Research questions and friction points this paper is trying to address.

Understanding inverse folding models' link to protein free energy

Improving zero-shot protein stability prediction methods

Clarifying free-energy foundations of inverse folding models

Innovation

Methods, ideas, or system contributions that make the work stand out.

Inverse folding models predict protein stability

Likelihood ratios approximate free-energy foundations

Simple methods improve zero-shot performance

🔎 Similar Papers

AlphaFolding: 4D Diffusion for Dynamic Protein Structure Prediction with Reference and Motion Guidance