🤖 AI Summary
Accurately predicting the change in protein stability (ΔΔG) induced by single-point amino acid mutations is critical for elucidating disease mechanisms and guiding rational drug design; however, existing methods suffer from data sparsity and limited generalization capacity. To address these challenges, we propose ThermoMPNN+, a novel graph neural network architecture that integrates multi-source sequence and structural embeddings. Crucially, it introduces an implicit-space cross-modal distillation mechanism to enable synergistic modeling and knowledge transfer between sequence and 3D structural information. Evaluated on standard benchmarks—including S2648 and SKEMPI 2.0—ThermoMPNN+ achieves state-of-the-art performance, reducing mean absolute error by 18.7% compared to prior methods. The model significantly improves both predictive accuracy and interpretability. Its open-source implementation provides a robust, general-purpose tool for quantitative assessment of mutational effects.
📝 Abstract
Predicting the impact of single-point amino acid mutations on protein stability is essential for understanding disease mechanisms and advancing drug development. Protein stability, quantified by changes in Gibbs free energy ($DeltaDelta G$), is influenced by these mutations. However, the scarcity of data and the complexity of model interpretation pose challenges in accurately predicting stability changes. This study proposes the application of deep neural networks, leveraging transfer learning and fusing complementary information from different models, to create a feature-rich representation of the protein stability landscape. We developed four models, with our third model, ThermoMPNN+, demonstrating the best performance in predicting $DeltaDelta G$ values. This approach, which integrates diverse feature sets and embeddings through latent transfusion techniques, aims to refine $DeltaDelta G$ predictions and contribute to a deeper understanding of protein dynamics, potentially leading to advancements in disease research and drug discovery.