Variational Inference for Evidential Deep Learning

📅 2026-05-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses critical limitations in traditional evidential deep learning, where the KL penalty suppresses evidence only for negative classes, leading to uncontrolled evidence growth and degraded uncertainty quantification, while the common choice of Dirichlet parameter α = e + 1 lacks theoretical justification. To overcome these issues, the authors reformulate evidential deep learning through variational inference, proposing the first VI-EDL framework. They derive an evidence lower bound (ELBO) that effectively regularizes evidence magnitude and establish a generalization error bound, rigorously proving that α = e + 1 minimizes this bound. The method achieves state-of-the-art performance on standard vision and medical datasets and demonstrates superior uncertainty quantification in out-of-distribution detection, noise identification, and autonomous driving tasks.
📝 Abstract
While Deep Neural Networks (DNNs) achieve remarkable performance, their tendency to produce overconfident predictions. Evidential Deep Learning (EDL) mitigates this by formulating predictions as a Dirichlet distribution over class probabilities to explicitly quantify epistemic uncertainty. However, we found that the conventional EDL suffers from two fundamental limitations: a Kullback-Leibler (KL) penalty that only suppresses the evidence of negative classes, producing excessively high evidence therefore decreasing the model's ability to quantify uncertainty, and an absence in theoretical guarantee of setting Dirichlet parameter $α=e+1$. In this paper, we propose a mathematically principled framework, Variational Inference Evidential Deep Learning (VI-EDL). By reformulating evidential learning through the lens of variational inference, we derive an Evidence Lower Bound (ELBO), which prevents the evidence from growing excessively. Theoretically, we rigorously establish a generalization bound and reveal how the predicted uncertainty, feature and network complexity affect this bound, and why setting $\boldsymbolα = \mathbf{e} + \mathbf{1}$ can minimize it. Extensive experiments on standard visual and medical datasets demonstrate that VI-EDL achieves state-of-the-art performance, showing excellent performance in out-of-distribution detection, noise detection and autonomous driving scenario. The code is available in https://github.com/seutjw/VI-EDL.
Problem

Research questions and friction points this paper is trying to address.

Evidential Deep Learning
epistemic uncertainty
Dirichlet distribution
overconfident predictions
Kullback-Leibler penalty
Innovation

Methods, ideas, or system contributions that make the work stand out.

Variational Inference
Evidential Deep Learning
Uncertainty Quantification
Dirichlet Distribution
Generalization Bound
J
Jiawei Tang
School of Computer Science and Engineering, Southeast University, Nanjing 210096, China
X
Xinyan Du
School of Computer Science and Engineering, Southeast University, Nanjing 210096, China
Hui Liu
Hui Liu
City University of Hong Kong
AI SecurityTrustworthy AILLMFake News Detection
Junhui Hou
Junhui Hou
Department of Computer Science, City University of Hong Kong
Neural Spatial Computing
Y
Yuheng Jia
School of Computer Science and Engineering, Southeast University, Nanjing 210096, China, and also with the Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China