đ€ AI Summary
Proteinâligand binding affinity prediction faces a dual bottleneck: high-accuracy physical methods (e.g., alchemical free-energy perturbation, AB-FEP) incur prohibitive computational cost, while machine learning (ML) models are limited by scarcity of high-quality experimental or computed labels. To address this, we introduce ToxBenchâthe first large-scale, AB-FEPâderived benchmark dataset specifically for human estrogen receptor α (ERα), comprising 8,770 proteinâligand complexes and employing ligand-wise non-overlapping splits to ensure rigorous generalization evaluation. We further propose DualBind, a novel ML framework integrating structural inputs with physics-informed consistency constraints via a dual-loss objective. On ToxBench, DualBind achieves a root-mean-square error (RMSE) of 1.75 kcal/molâmatching AB-FEP accuracy at negligible computational cost. This work establishes a new paradigm for high-throughput, accurate binding affinity prediction in toxicity assessment and drug discovery.
đ Abstract
Protein-ligand binding affinity prediction is essential for drug discovery and toxicity assessment. While machine learning (ML) promises fast and accurate predictions, its progress is constrained by the availability of reliable data. In contrast, physics-based methods such as absolute binding free energy perturbation (AB-FEP) deliver high accuracy but are computationally prohibitive for high-throughput applications. To bridge this gap, we introduce ToxBench, the first large-scale AB-FEP dataset designed for ML development and focused on a single pharmaceutically critical target, Human Estrogen Receptor Alpha (ER$α$). ToxBench contains 8,770 ER$α$-ligand complex structures with binding free energies computed via AB-FEP with a subset validated against experimental affinities at 1.75 kcal/mol RMSE, along with non-overlapping ligand splits to assess model generalizability. Using ToxBench, we further benchmark state-of-the-art ML methods, and notably, our proposed DualBind model, which employs a dual-loss framework to effectively learn the binding energy function. The benchmark results demonstrate the superior performance of DualBind and the potential of ML to approximate AB-FEP at a fraction of the computational cost.