Graph Diffusion Network for Drug-Gene Prediction

📅 2025-02-13
📈 Citations: 0
Influential: 0
📄 PDF

career value

194K/year
🤖 AI Summary
Drug–gene association prediction faces dual challenges of data sparsity and low efficiency in contrastive learning. To address these, we propose an end-to-end graph diffusion framework that introduces, for the first time, a meta-path-driven homogeneous graph learning mechanism—transforming the heterogeneous drug–gene–disease network into a semantically consistent homogeneous graph. We further design a parallel diffusion network to dynamically generate high-quality hard negative samples, eliminating inefficient exhaustive negative sampling. Our method integrates graph neural networks, self-supervised contrastive learning, and differentiable graph diffusion. Evaluated on DGIdb 4.0, it significantly outperforms state-of-the-art methods (achieving a +4.2% average AUC gain) and demonstrates strong generalization capability. It effectively models ternary heterogeneous relationships and supports cold-start prediction.

Technology Category

Application Category

📝 Abstract
Predicting drug-gene associations is crucial for drug development and disease treatment. While graph neural networks (GNN) have shown effectiveness in this task, they face challenges with data sparsity and efficient contrastive learning implementation. We introduce a graph diffusion network for drug-gene prediction (GDNDGP), a framework that addresses these limitations through two key innovations. First, it employs meta-path-based homogeneous graph learning to capture drug-drug and gene-gene relationships, ensuring similar entities share embedding spaces. Second, it incorporates a parallel diffusion network that generates hard negative samples during training, eliminating the need for exhaustive negative sample retrieval. Our model achieves superior performance on the DGIdb 4.0 dataset and demonstrates strong generalization capability on tripartite drug-gene-disease networks. Results show significant improvements over existing methods in drug-gene prediction tasks, particularly in handling complex heterogeneous relationships. The source code is publicly available at https://github.com/csjywu1/GDNDGP.
Problem

Research questions and friction points this paper is trying to address.

Predicts drug-gene associations for drug development
Improves data sparsity in graph neural networks
Enhances contrastive learning with hard negative samples
Innovation

Methods, ideas, or system contributions that make the work stand out.

meta-path-based homogeneous graph learning
parallel diffusion network for hard negatives
improved drug-gene prediction accuracy
💼 Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid
Jiayang Wu
Jiayang Wu
Jinan University
AI for science
W
Wensheng Gan
College of Cyber Security, Jinan University, Guangzhou 510632, China; and Pazhou Lab, Guangzhou 510330, China
Philip S. Yu
Philip S. Yu
Professor of Computer Science, University of Illinons at Chicago
Data miningDatabasePrivacy