AANet: Virtual Screening under Structural Uncertainty via Alignment and Aggregation

📅 2025-06-06

📈 Citations: 0

✨ Influential: 0

career value

182K/year

🤖 AI Summary

Virtual screening (VS) performance degrades significantly on apo or AlphaFold2-predicted protein structures due to the absence of accurate ligand-binding pocket annotations. Method: We propose the first alignment-aggregation framework explicitly designed for structural uncertainty: a trimodal contrastive learning scheme unifies representations of ligands, holographic pockets, and detected cavities; coupled with a cross-attention adapter that dynamically aggregates candidate binding sites—eliminating reliance on manual pocket annotation while preserving activity signal modeling. Contribution/Results: Evaluated on a newly constructed apo-structure benchmark, our method achieves an enrichment factor EF₁% of 37.19—substantially surpassing the state-of-the-art (11.75). It maintains competitive performance on holo structures, demonstrating strong generalizability and practical utility. This work establishes a novel paradigm for reliable VS in early drug discovery using low-confidence protein structures.

Technology Category

Application Category

📝 Abstract

Virtual screening (VS) is a critical component of modern drug discovery, yet most existing methods--whether physics-based or deep learning-based--are developed around holo protein structures with known ligand-bound pockets. Consequently, their performance degrades significantly on apo or predicted structures such as those from AlphaFold2, which are more representative of real-world early-stage drug discovery, where pocket information is often missing. In this paper, we introduce an alignment-and-aggregation framework to enable accurate virtual screening under structural uncertainty. Our method comprises two core components: (1) a tri-modal contrastive learning module that aligns representations of the ligand, the holo pocket, and cavities detected from structures, thereby enhancing robustness to pocket localization error; and (2) a cross-attention based adapter for dynamically aggregating candidate binding sites, enabling the model to learn from activity data even without precise pocket annotations. We evaluated our method on a newly curated benchmark of apo structures, where it significantly outperforms state-of-the-art methods in blind apo setting, improving the early enrichment factor (EF1%) from 11.75 to 37.19. Notably, it also maintains strong performance on holo structures. These results demonstrate the promise of our approach in advancing first-in-class drug discovery, particularly in scenarios lacking experimentally resolved protein-ligand complexes.

Problem

Research questions and friction points this paper is trying to address.

Improves virtual screening accuracy for uncertain protein structures

Aligns ligand, holo pocket, and cavity representations robustly

Enables learning without precise pocket annotations via dynamic aggregation

Innovation

Methods, ideas, or system contributions that make the work stand out.

Tri-modal contrastive learning aligns ligand and pocket representations

Cross-attention adapter aggregates candidate binding sites dynamically

Robust virtual screening under structural uncertainty via alignment-aggregation

🔎 Similar Papers

PharmacoMatch: Efficient 3D Pharmacophore Screening through Neural Subgraph Matching