Contrastive Geometric Learning Unlocks Unified Structure- and Ligand-Based Drug Design

๐Ÿ“… 2026-01-14
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work proposes ConGLUDe, a novel framework that unifies protein structure and ligand information within a single model through contrastive geometric learning, overcoming the limitations of traditional drug design approaches that treat these data sources in isolation and rely on predefined binding pockets. By jointly training on proteinโ€“ligand complexes and large-scale bioactivity data, ConGLUDe employs a geometric protein encoder to generate embeddings for full-length proteins and implicit binding sites, coupled with an efficient ligand encoder. This enables zero-shot virtual screening, target identification, and ligand-guided binding site prediction without requiring pre-specified pockets. Experimental results demonstrate that ConGLUDe significantly outperforms existing methods in zero-shot virtual screening and target identification tasks, while accurately predicting ligand-conditioned binding pockets.

Technology Category

Application Category

๐Ÿ“ Abstract
Structure-based and ligand-based computational drug design have traditionally relied on disjoint data sources and modeling assumptions, limiting their joint use at scale. In this work, we introduce Contrastive Geometric Learning for Unified Computational Drug Design (ConGLUDe), a single contrastive geometric model that unifies structure- and ligand-based training. ConGLUDe couples a geometric protein encoder that produces whole-protein representations and implicit embeddings of predicted binding sites with a fast ligand encoder, removing the need for pre-defined pockets. By aligning ligands with both global protein representations and multiple candidate binding sites through contrastive learning, ConGLUDe supports ligand-conditioned pocket prediction in addition to virtual screening and target fishing, while being trained jointly on protein-ligand complexes and large-scale bioactivity data. Across diverse benchmarks, ConGLUDe achieves competitive zero-shot virtual screening performance, substantially outperforms existing methods on a challenging target fishing task, and demonstrates state-of-the-art ligand-conditioned pocket selection. These results highlight the advantages of unified structure-ligand training and position ConGLUDe as a step toward general-purpose foundation models for drug discovery.
Problem

Research questions and friction points this paper is trying to address.

structure-based drug design
ligand-based drug design
unified drug design
binding site prediction
computational drug discovery
Innovation

Methods, ideas, or system contributions that make the work stand out.

contrastive learning
geometric deep learning
unified drug design
binding site prediction
zero-shot virtual screening
๐Ÿ”Ž Similar Papers
No similar papers found.
L
Lisa Schneckenreiter
ELLIS Unit Linz, LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Austria
S
Sohvi Luukkonen
ELLIS Unit Linz, LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Austria
Lukas Friedrich
Lukas Friedrich
Principal Scientist Computational Chemistry & Biologics, Merck Healthcare, Darmstadt, Germany
Artificial IntelligenceMachine LearningDrug DiscoveryDe novo Design
Daniel Kuhn
Daniel Kuhn
Computational chemist @ Merck
G
G. Klambauer
ELLIS Unit Linz, LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Austria; Clinical Research Institute for Medical Artificial Intelligence, Johannes Kepler University, Linz, Austria