SEANN: A Domain-Informed Neural Network for Epidemiological Insights

📅 2025-01-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Deep neural networks (DNNs) suffer from poor generalizability and limited interpretability when applied to epidemiological modeling under small-sample, high-noise data regimes. Method: We propose a domain-guided neural network that integrates pooled effect sizes (PES) derived from meta-analysis directly into the DNN’s differentiable loss function—enabling end-to-end, verifiable, knowledge-informed training. Our approach synergistically combines deep learning, eXplainable AI (XAI), and PES-driven custom loss design. Results: Experiments demonstrate substantial improvements in predictive generalization under data scarcity; learned feature relationships align more closely with epidemiological prior knowledge, thereby enhancing scientific credibility and mechanistic interpretability. This work establishes a novel paradigm for knowledge-guided, interpretable AI in public health modeling.

Technology Category

Application Category

📝 Abstract
In epidemiology, traditional statistical methods such as logistic regression, linear regression, and other parametric models are commonly employed to investigate associations between predictors and health outcomes. However, non-parametric machine learning techniques, such as deep neural networks (DNNs), coupled with explainable AI (XAI) tools, offer new opportunities for this task. Despite their potential, these methods face challenges due to the limited availability of high-quality, high-quantity data in this field. To address these challenges, we introduce SEANN, a novel approach for informed DNNs that leverages a prevalent form of domain-specific knowledge: Pooled Effect Sizes (PES). PESs are commonly found in published Meta-Analysis studies, in different forms, and represent a quantitative form of a scientific consensus. By direct integration within the learning procedure using a custom loss, we experimentally demonstrate significant improvements in the generalizability of predictive performances and the scientific plausibility of extracted relationships compared to a domain-knowledge agnostic neural network in a scarce and noisy data setting.
Problem

Research questions and friction points this paper is trying to address.

Epidemiology
Deep Neural Networks
Data Scarcity
Innovation

Methods, ideas, or system contributions that make the work stand out.

SEANN
Pooled Effect Size
Epidemiological Prediction
🔎 Similar Papers
No similar papers found.
J
Jean-Baptiste Guimbaud
Laboratory of Image Informatics and Information Systems (LIR IS), UMR5205 CNRS, F-69622 Villeurbanne, France; Barcelona Institute for Global Health (ISGlobal), Barcelona, Spain; Meersens, Lyon, France
Marc Plantevit
Marc Plantevit
Professor at EPITA, Deputy head of EPITA Research Lab (LRE)
Data miningDatabasePattern MiningMachine LearningxAI
L
L'ea Maitre
Barcelona Institute for Global Health (ISGlobal), Barcelona, Spain
Rémy Cazabet
Rémy Cazabet
LIRIS, Lyon 1 university (UCBL), Lyon, France
Network ScienceMachine LearningComplex networksComplex systemsData Mining