Biological Pathway Informed Models with Graph Attention Networks (GATs)

πŸ“… 2025-08-30
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing genomic models typically treat genes as unstructured labels or pathways merely as β€œgene sets,” neglecting their intrinsic topological structure and regulatory interactions. Method: We propose a pathway-aware modeling framework based on Graph Attention Networks (GAT), explicitly encoding intra-pathway regulatory relationships at the gene level using known biological pathway graphs; we further introduce an edge intervention mechanism to represent drug target perturbations, enabling dynamic rewiring of feedback loops. The method integrates hierarchical pathway modeling, training driven by temporal mRNA expression data, and mechanism-driven interpretability design. Contribution/Results: Experiments show an 81% reduction in MSE over an MLP baseline; the model successfully recapitulates all five known interactions in the TP53–MDM2–MDM4 pathway and significantly improves cross-condition generalizability and biological interpretability.

Technology Category

Application Category

πŸ“ Abstract
Biological pathways map gene-gene interactions that govern all human processes. Despite their importance, most ML models treat genes as unstructured tokens, discarding known pathway structure. The latest pathway-informed models capture pathway-pathway interactions, but still treat each pathway as a "bag of genes" via MLPs, discarding its topology and gene-gene interactions. We propose a Graph Attention Network (GAT) framework that models pathways at the gene level. We show that GATs generalize much better than MLPs, achieving an 81% reduction in MSE when predicting pathway dynamics under unseen treatment conditions. We further validate the correctness of our biological prior by encoding drug mechanisms via edge interventions, boosting model robustness. Finally, we show that our GAT model is able to correctly rediscover all five gene-gene interactions in the canonical TP53-MDM2-MDM4 feedback loop from raw time-series mRNA data, demonstrating potential to generate novel biological hypotheses directly from experimental data.
Problem

Research questions and friction points this paper is trying to address.

Modeling gene-level interactions within biological pathways
Capturing pathway topology beyond bag-of-genes representations
Predicting pathway dynamics under unseen treatment conditions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Graph Attention Networks model gene-level pathways
Edge interventions encode drug mechanisms for robustness
GATs capture gene-gene interactions from time-series data
πŸ”Ž Similar Papers
No similar papers found.
G
Gavin Wong
Yale University
P
Ping Shu Ho
NVIDIA AI Tech Center
I
Ivan Au Yeung
NVIDIA AI Tech Center
K
Ka Chun Cheung
NVIDIA AI Tech Center
Simon See
Simon See
nvidia
applied mathematicsAImachine learningHigh Performance ComputingSimulation