Co-Diffusion: An Affinity-Aware Two-Stage Latent Diffusion Framework for Generalizable Drug-Target Affinity Prediction

πŸ“… 2026-03-11
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the challenges of representation collapse, label scarcity, and domain shift in drug–target affinity (DTA) prediction under cold-start scenarios by proposing Co-Diffusion, a novel framework that introduces, for the first time, an affinity-aware latent diffusion mechanism into DTA modeling. The approach adopts a two-stage paradigm: it first constructs an affinity-guided latent manifold and then incorporates modality-specific latent diffusion as a regularizer to mitigate the conflict between generative and regression objectives. By integrating supervised embedding alignment, modality-specific perturbations, and variational inference optimization, Co-Diffusion substantially enhances zero-shot generalization on unseen molecular scaffolds and novel protein families. Extensive experiments demonstrate that the method consistently outperforms state-of-the-art approaches across multiple benchmarks, offering robust support for virtual screening applications.

Technology Category

Application Category

πŸ“ Abstract
Predicting drug-target affinity is fundamental to virtual screening and lead optimization. However, existing deep models often suffer from representation collapse in stringent cold-start regimes, where the scarcity of labels and domain shifts prevent the learning of transferable pharmacophores and binding motifs. In this paper, we propose Co-Diffusion, a novel affinity-aware framework that redefines DTA prediction as a constrained latent denoising process to enhance generalization. Co-Diffusion employs a two-stage paradigm: Stage I establishes an affinity-steered latent manifold by aligning drug and target embeddings under an explicit supervised objective, ensuring that the latent space reflects the intrinsic binding landscape. Stage II introduces modality-specific latent diffusion as a stochastic perturb-and-denoise regularizer, forcing the model to recover consistent affinity semantics from noisy structural representations. This approach effectively mitigates the reconstruction-regression conflict common in generative DTA models. Theoretically, we show that Co-Diffusion maximizes a variational lower bound on the joint likelihood of drug structures, protein sequences, and binding strength. Extensive experiments across multiple benchmarks demonstrate that Co-Diffusion significantly outperforms state-of-the-art baselines, particularly yielding superior zero-shot generalization on unseen molecular scaffolds and novel protein families-paving a robust path for in silico drug prioritization in unexplored chemical spaces.
Problem

Research questions and friction points this paper is trying to address.

drug-target affinity
cold-start
representation collapse
domain shift
generalization
Innovation

Methods, ideas, or system contributions that make the work stand out.

latent diffusion
drug-target affinity
two-stage framework
zero-shot generalization
affinity-aware modeling
Y
Yining Qian
School of Computer Science and Engineering, Northeastern University, Shenyang 110819, China
P
Pengjie Wang
College of Information Science and Engineering, Northeastern University, Shenyang 110819, China
Yixiao Li
Yixiao Li
Georgia Institute of Technology
Machine Learning
A
An-Yang Lu
College of Information Science and Engineering, Northeastern University, Shenyang 110819, China
C
Cheng Tan
Westlake University, Hangzhou 310000, China
S
Shuang Li
School of Artificial Intelligence, Beihang University, Beijing 100000, China
L
Lijun Liu
Key Laboratory of Bioresource Research and Development of Liaoning Province, College of Life and Health Sciences, Northeastern University, Shenyang 110169, China