Fitness aligned structural modeling enables scalable virtual screening with AuroBind

📅 2025-08-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Over 96% of human proteins lack approved therapeutic agents, primarily due to the limited atomic-level accuracy and inability of conventional structure-based virtual screening methods to model ligand–protein binding adaptability. To address this, we introduce AuroBind—a unified framework integrating direct preference optimization, self-distillation of high-confidence protein–ligand complexes, and teacher–student acceleration—enabling joint prediction of binding poses and conformational adaptability. Fine-tuned on a million-compound chemogenomic dataset, AuroBind achieves atomic-resolution structural fidelity and functional awareness, enabling ultra-large-scale library screening with >10,000× speedup. In prospective screening across ten disease-relevant targets, experimental hit rates ranged from 7% to 69%, yielding sub-nanomolar to picomolar compounds. Notably, AuroBind identified first-in-class agonists and antagonists for orphan receptors GPR151 and GPR160, with functional modulation validated in hepatocellular and prostate cancer models.

Technology Category

Application Category

📝 Abstract
Most human proteins remain undrugged, over 96% of human proteins remain unexploited by approved therapeutics. While structure-based virtual screening promises to expand the druggable proteome, existing methods lack atomic-level precision and fail to predict binding fitness, limiting translational impact. We present AuroBind, a scalable virtual screening framework that fine-tunes a custom atomic-level structural model on million-scale chemogenomic data. AuroBind integrates direct preference optimization, self-distillation from high-confidence complexes, and a teacher-student acceleration strategy to jointly predict ligand-bound structures and binding fitness. The proposed models outperform state-of-the-art models on structural and functional benchmarks while enabling 100,000-fold faster screening across ultra-large compound libraries. In a prospective screen across ten disease-relevant targets, AuroBind achieved experimental hit rates of 7-69%, with top compounds reaching sub-nanomolar to picomolar potency. For the orphan GPCRs GPR151 and GPR160, AuroBind identified both agonists and antagonists with success rates of 16-30%, and functional assays confirmed GPR160 modulation in liver and prostate cancer models. AuroBind offers a generalizable framework for structure-function learning and high-throughput molecular screening, bridging the gap between structure prediction and therapeutic discovery.
Problem

Research questions and friction points this paper is trying to address.

Addresses undrugged human proteins with scalable virtual screening
Improves atomic-level precision in predicting binding fitness
Enables rapid screening across ultra-large compound libraries
Innovation

Methods, ideas, or system contributions that make the work stand out.

Fine-tunes atomic-level model on chemogenomic data
Integrates preference optimization and self-distillation
Enables ultra-fast screening with teacher-student strategy
Z
Zhongyue Zhang
Global Institute of Future Technology, Shanghai Jiao Tong University, Shanghai, China; Lingang Laboratory, Shanghai, China
Jiahua Rao
Jiahua Rao
Sun Yat-sen University
AI4ScienceMulti-scale Learning
Jie Zhong
Jie Zhong
Medicinal Chemistry and Bioinformatics Center, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
W
Weiqiang Bai
Research Institute of Intelligent Complex Systems, Fudan University, Shanghai, China; Shanghai Artificial Intelligence Laboratory, Shanghai, China
D
Dongxue Wang
Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Guangdong, China
S
Shaobo Ning
Medicinal Chemistry and Bioinformatics Center, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
L
Lifeng Qiao
Shanghai Artificial Intelligence Laboratory, Shanghai, China
S
Sheng Xu
Research Institute of Intelligent Complex Systems, Fudan University, Shanghai, China; Shanghai Artificial Intelligence Laboratory, Shanghai, China
R
Runze Ma
Global Institute of Future Technology, Shanghai Jiao Tong University, Shanghai, China; Lingang Laboratory, Shanghai, China
W
Will Hua
Lingang Laboratory, Shanghai, China
Jack Xiaoyu Chen
Jack Xiaoyu Chen
Massachusetts Institute of Technology
Synthetic biology
Odin Zhang
Odin Zhang
UW CS, Institute for Protein Design
AI4ScienceBiomolecule DesignDrug DesignComputer-aided Drug Design
W
Wei Lu
Global Institute of Future Technology, Shanghai Jiao Tong University, Shanghai, China
H
Hanyi Feng
Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Guangdong, China
He Yang
He Yang
Xi'an Jiaotong University
Federated LearningDeep LearningPrivacy & Security
X
Xinchao Shi
Medicinal Chemistry and Bioinformatics Center, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
R
Rui Li
Medicinal Chemistry and Bioinformatics Center, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
W
Wanli Ouyang
Shanghai Artificial Intelligence Laboratory, Shanghai, China
Xinzhu Ma
Xinzhu Ma
Associate Professor, Beihang University
deep learningcomputer vision3D scene understandingai4science
J
Jiahao Wang
Global Institute of Future Technology, Shanghai Jiao Tong University, Shanghai, China; Lingang Laboratory, Shanghai, China
J
Jixian Zhang
Global Institute of Future Technology, Shanghai Jiao Tong University, Shanghai, China
J
Jia Duan
Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Guangdong, China
S
Siqi Sun
Research Institute of Intelligent Complex Systems, Fudan University, Shanghai, China; Shanghai Artificial Intelligence Laboratory, Shanghai, China
J
Jian Zhang
Medicinal Chemistry and Bioinformatics Center, School of Medicine, Shanghai Jiao Tong University, Shanghai, China; Key Laboratory of Protection, Development and Utilization of Medicinal Resources in Liupanshan Area, Ministry of Education, Peptide & Protein Drug Research Center, School of Pharmacy, Ningxia Medical University, Ningxia, China
Shuangjia Zheng
Shuangjia Zheng
Shanghai Jiao Tong University
Generative AIDrug DiscoverySynthetic BiologyMulti-Agent System