π€ AI Summary
This study addresses the challenge of collaborative modeling over heterogeneous, multi-source epidemiological dataβsome containing sensitive information and others protected via differential privacy. We propose the first privacy-preserving analytical framework integrating differential privacy, deep learning, and mechanistic infectious disease dynamics. Methodologically, we innovatively introduce differentially private synthetic financial data as auxiliary features and design a unified architecture capable of jointly modeling both raw and privacy-protected data, enabling end-to-end fusion and dynamic mechanism learning under rigorous privacy guarantees. Experiments demonstrate that, under privacy budgets Ξ΅ = 1β5, our framework reduces epidemic prediction error by 12.7%β23.4% and accurately infers transmission parameters (Rβ estimation error < 8.5%), significantly outperforming baseline models. Our core contribution is the first cross-domain transfer of differentially private synthetic data to epidemiological modeling, establishing a novel paradigm that jointly optimizes privacy preservation, predictive accuracy, and mechanistic interpretability.
π Abstract
It is now well understood that diverse datasets provide a lot of value in key epidemiology and public health analyses, such as forecasting and nowcasting, development of epidemic models, evaluation and design of interventions and resource allocation. Some of these datasets are often sensitive, and need adequate privacy protections. There are many models of privacy, but Differential Privacy (DP) has become a de facto standard because of its strong guarantees, without making models about adversaries. In this paper, we develop a framework the integrates deep learning and epidemic models to simultaneously perform epidemic forecasting and learning a mechanistic model of epidemic spread, while incorporating multiple datasets for these analyses, including some with DP guarantees. We demonstrate our framework using a realistic but synthetic financial dataset with DP; such a dataset has not been used in such epidemic analyses. We show that this dataset provides significant value in forecasting and learning an epidemic model, even when used with DP guarantees.