FLOWR.root: A flow matching based foundation model for joint multi-purpose structure-aware 3D ligand generation and affinity prediction

📅 2025-10-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing 3D ligand generation models suffer from geometric distortion, high strain energy, and a trade-off between generation quality and binding affinity prediction accuracy. To address these challenges, we propose FLOWR.root—the first foundational model integrating equivariance and flow matching to jointly perform pocket-conditioned 3D ligand generation and multi-endpoint affinity prediction (pIC50/pKi/pKd/pEC50). Our method introduces importance sampling for inference-time structural refinement, hybrid-fidelity training, parameter-efficient fine-tuning, and a quantum-mechanics-validated geometric constraint framework. FLOWR.root achieves state-of-the-art performance in both unconditional and pocket-conditioned generation tasks. On the SPINDR and Schrödinger FEP+/OpenFE benchmarks, it significantly outperforms prior methods in affinity prediction—delivering higher computational efficiency and stronger correlation with experimental measurements. The framework comprehensively supports key drug discovery workflows, including de novo design, pharmacophore-guided optimization, and fragment growth.

Technology Category

Application Category

📝 Abstract
We present Flowr.root, an equivariant flow-matching model for pocket-aware 3D ligand generation with joint binding affinity prediction and confidence estimation. The model supports de novo generation, pharmacophore-conditional sampling, fragment elaboration, and multi-endpoint affinity prediction (pIC50, pKi, pKd, pEC50). Training combines large-scale ligand libraries with mixed-fidelity protein-ligand complexes, followed by refinement on curated co-crystal datasets and parameter-efficient finetuning for project-specific adaptation. Flowr.root achieves state-of-the-art performance in unconditional 3D molecule generation and pocket-conditional ligand design, producing geometrically realistic, low-strain structures. The integrated affinity prediction module demonstrates superior accuracy on the SPINDR test set and outperforms recent models on the Schrodinger FEP+/OpenFE benchmark with substantial speed advantages. As a foundation model, Flowr.root requires finetuning on project-specific datasets to account for unseen structure-activity landscapes, yielding strong correlation with experimental data. Joint generation and affinity prediction enable inference-time scaling through importance sampling, steering molecular design toward higher-affinity compounds. Case studies validate this: selective CK2alpha ligand generation against CLK3 shows significant correlation between predicted and quantum-mechanical binding energies, while ERalpha and TYK2 scaffold elaboration demonstrates strong agreement with QM calculations. By integrating structure-aware generation, affinity estimation, and property-guided sampling, Flowr.root provides a comprehensive foundation for structure-based drug design spanning hit identification through lead optimization.
Problem

Research questions and friction points this paper is trying to address.

Develops joint 3D ligand generation and affinity prediction model
Enables structure-aware drug design from hit identification to optimization
Integrates property-guided sampling for higher-affinity compound generation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Equivariant flow-matching model for pocket-aware 3D ligand generation
Joint binding affinity prediction with confidence estimation
Parameter-efficient finetuning for project-specific adaptation
🔎 Similar Papers
No similar papers found.
Julian Cremer
Julian Cremer
Unknown affiliation
T
Tuan Le
Machine Learning & Computational Sciences, Pfizer Worldwide R&D, Berlin, Germany
M
Mohammad M. Ghahremanpour
Computational Chemistry, Medicine Design, Pfizer Worldwide R&D, Cambridge, USA
E
Emilia Sługocka
Doctoral School of Medical and Health Sciences, Jagiellonian University Medical College, Cracow, Poland; Department of Physicochemical Drug Analysis, Faculty of Pharmacy, Jagiellonian University Medical College, Cracow, Poland
F
Filipe Menezes
Institute of Structural Biology, Molecular Targets and Therapeutics Center, Helmholtz Munich, Neuherberg, Germany; TUM School of Natural Sciences, Department of Bioscience, Bayerisches NMR Zentrum, Technical University of Munich, Garching, Germany
Djork-Arné Clevert
Djork-Arné Clevert
Pfizer, VP, Machine Learning Research
Drug DiscoveryMachine LearningDeep LearningComputational ChemistryComputational Biology