Large Language Models Meet Biomedical Knowledge Graphs for Mechanistically Grounded Therapeutic Prioritization

📅 2026-04-17
📈 Citations: 0
Influential: 0
📄 PDF

career value

182K/year
🤖 AI Summary
This study addresses the limitation of existing drug repositioning approaches, which often struggle to distinguish biologically plausible candidates due to their reliance on historical associations. To overcome this, the authors propose DrugKLM, a novel hybrid framework that uniquely integrates the structural knowledge from knowledge graphs with the mechanistic reasoning capabilities of large language models. By combining knowledge graph embeddings, transcriptomic phenotype alignment, and expert-informed clinical context, DrugKLM generates interpretable and clinically relevant therapeutic hypotheses. The method consistently outperforms approaches based solely on knowledge graphs or language models across multiple benchmarks. Its confidence scores show significant concordance with survival-associated transcriptional signatures across 12 TCGA cancer types, and expert evaluation confirms that DrugKLM prioritizes mechanistically sound and clinically promising candidate drugs in five distinct cancers.

Technology Category

Application Category

📝 Abstract
Drug repurposing is often framed as a candidate identification task, but existing approaches provide limited guidance for distinguishing biologically plausible candidates from historically well-connected ones. Here we introduce DrugKLM, a hybrid framework that integrates biomedical knowledge graph structure with large language model-based mechanistic reasoning to enable mechanistically grounded therapeutic prioritization. Across benchmark datasets, DrugKLM outperforms knowledge graph-only and language model-only baselines, including TxGNN. Beyond improved recall, DrugKLM confidence scores exhibit functional alignment with molecular phenotypes: higher scores are associated with transcriptional signatures linked to improved survival across 12 TCGA cancers. The scoring framework preferentially captures biologically perturbational signals rather than historical indication patterns. Expert curation across five cancers further reveals systematic differences in prioritization behavior, with DrugKLM elevating candidates supported by coherent mechanistic rationale and disease-specific clinical context. Together, these results establish DrugKLM as an evidence-integrative framework that translates heterogeneous biomedical data into mechanistically interpretable and clinically grounded therapeutic hypotheses.
Problem

Research questions and friction points this paper is trying to address.

drug repurposing
biomedical knowledge graphs
mechanistic reasoning
therapeutic prioritization
biological plausibility
Innovation

Methods, ideas, or system contributions that make the work stand out.

Drug Repurposing
Knowledge Graph
Large Language Model
Mechanistic Reasoning
Therapeutic Prioritization
C
Chih-Hsuan Wei
Division of Intramural Research (DIR), National Library of Medicine (NLM), National Institutes of Health (NIH); Bethesda, MD 20894, USA
C
Chi-Ping Day
Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD 20894, USA
Zhizheng Wang
Zhizheng Wang
Postdoc, Division of Intramural Research (DIR), NLM, NIH
Large Language ModelsRepresentation LearningGraph Data MiningBioinformatics
C
Christine C. Alewine
Dartmouth Health, Dartmouth Hitchcock Medical Center, Lebanon, NH, USA
B
Betty Tyler
Hunterian Neurosurgical Laboratory, Johns Hopkins School of Medicine, Baltimore, MD 21231, USA
H
Hasan Slika
Hunterian Neurosurgical Laboratory, Johns Hopkins School of Medicine, Baltimore, MD 21231, USA
D
David Saraf
Hunterian Neurosurgical Laboratory, Johns Hopkins School of Medicine, Baltimore, MD 21231, USA
C
Chin-Hsien Tai
National Cancer Institute (NCI), National Institutes of Health (NIH), Bethesda, MD 20894, USA
J
Joey Chan
Division of Intramural Research (DIR), National Library of Medicine (NLM), National Institutes of Health (NIH); Bethesda, MD 20894, USA
Robert Leaman
Robert Leaman
Staff Scientist, NCBI/NLM/NIH
Natural Language ProcessingMachine Learning
Zhiyong Lu
Zhiyong Lu
Senior Investigator, NLM; Adjunct Professor of CS, UIUC
BioNLPBiomedical InformaticsMedical AIArtificial Intelligence