Torch-Uncertainty: A Deep Learning Framework for Uncertainty Quantification

📅 2025-11-13

📈 Citations: 0

✨ Influential: 0

career value

172K/year

🤖 AI Summary

Deep learning models often lack reliable uncertainty quantification (UQ), and existing UQ methods are fragmented, with inconsistent evaluation protocols—hindering trustworthy deployment in safety-critical applications. To address this, we introduce the first modular, open-source UQ framework built on PyTorch Lightning, unifying support for classification, segmentation, and regression tasks. It integrates state-of-the-art UQ techniques—including Bayesian neural networks, deep ensembles, Monte Carlo Dropout, and temperature scaling—alongside standardized evaluation metrics (e.g., Expected Calibration Error, Brier Score, and AUROC for uncertainty). The framework adopts a decoupled architecture, enabling plug-and-play integration of UQ methods and fully automated evaluation pipelines. Extensive experiments across diverse benchmarks demonstrate that our framework significantly lowers the barrier to UQ adoption, improves evaluation efficiency, and enhances reproducibility—providing a systematic, production-ready toolkit for trustworthy AI.

Technology Category

Application Category

📝 Abstract

Deep Neural Networks (DNNs) have demonstrated remarkable performance across various domains, including computer vision and natural language processing. However, they often struggle to accurately quantify the uncertainty of their predictions, limiting their broader adoption in critical real-world applications. Uncertainty Quantification (UQ) for Deep Learning seeks to address this challenge by providing methods to improve the reliability of uncertainty estimates. Although numerous techniques have been proposed, a unified tool offering a seamless workflow to evaluate and integrate these methods remains lacking. To bridge this gap, we introduce Torch-Uncertainty, a PyTorch and Lightning-based framework designed to streamline DNN training and evaluation with UQ techniques and metrics. In this paper, we outline the foundational principles of our library and present comprehensive experimental results that benchmark a diverse set of UQ methods across classification, segmentation, and regression tasks. Our library is available at https://github.com/ENSTA-U2IS-AI/Torch-Uncertainty

Problem

Research questions and friction points this paper is trying to address.

Deep Neural Networks struggle with accurate uncertainty quantification

Lack of unified framework for evaluating uncertainty quantification methods

Need streamlined workflow for uncertainty techniques across diverse tasks

Innovation

Methods, ideas, or system contributions that make the work stand out.

PyTorch-based framework for uncertainty quantification

Streamlines DNN training with UQ techniques

Benchmarks UQ methods across multiple tasks

🔎 Similar Papers

No similar papers found.