DiagnoLLM: A Hybrid Bayesian Neural Language Framework for Interpretable Disease Diagnosis

📅 2025-11-08

📈 Citations: 0

✨ Influential: 0

career value

180K/year

🤖 AI Summary

Clinical deployment of AI for Alzheimer’s disease (AD) diagnosis requires both high accuracy and interpretability to ensure clinician trust and patient understanding. Method: We propose a biologically grounded, multimodal explainable AI framework that integrates single-cell and bulk RNA-seq data via Gaussian process Bayesian unmixing (GP-unmix); incorporates expression quantitative trait locus (eQTL)-informed regulatory priors to guide a neural classifier; and employs a large language model (LLM) as a controllable post-hoc reasoning module to generate personalized, actionable diagnostic reports—avoiding end-to-end black-box prediction. Contribution/Results: The framework achieves 88.0% diagnostic accuracy on an independent validation cohort. Human expert evaluation confirms the reports’ high factual accuracy, biological plausibility, and adaptability to both clinicians and patients. By jointly embedding domain-specific biological knowledge and human-centered explanation mechanisms, our approach establishes a new paradigm for trustworthy, clinically deployable AI in neurodegenerative disease diagnosis.

Technology Category

Application Category

📝 Abstract

Building trustworthy clinical AI systems requires not only accurate predictions but also transparent, biologically grounded explanations. We present exttt{DiagnoLLM}, a hybrid framework that integrates Bayesian deconvolution, eQTL-guided deep learning, and LLM-based narrative generation for interpretable disease diagnosis. DiagnoLLM begins with GP-unmix, a Gaussian Process-based hierarchical model that infers cell-type-specific gene expression profiles from bulk and single-cell RNA-seq data while modeling biological uncertainty. These features, combined with regulatory priors from eQTL analysis, power a neural classifier that achieves high predictive performance in Alzheimer's Disease (AD) detection (88.0% accuracy). To support human understanding and trust, we introduce an LLM-based reasoning module that translates model outputs into audience-specific diagnostic reports, grounded in clinical features, attribution signals, and domain knowledge. Human evaluations confirm that these reports are accurate, actionable, and appropriately tailored for both physicians and patients. Our findings show that LLMs, when deployed as post-hoc reasoners rather than end-to-end predictors, can serve as effective communicators within hybrid diagnostic pipelines.

Problem

Research questions and friction points this paper is trying to address.

Inferring cell-type-specific gene expression from bulk and single-cell RNA-seq data

Achieving accurate disease detection while providing transparent biological explanations

Generating audience-specific diagnostic reports for physicians and patients

Innovation

Methods, ideas, or system contributions that make the work stand out.

Bayesian deconvolution infers cell-type gene expression

Neural classifier uses eQTL priors for disease detection

LLM generates audience-specific diagnostic reports from outputs

🔎 Similar Papers

Large Language Models for Disease Diagnosis: A Scoping Review