MUSE: Multimodal Uncertainty Quantification of State Estimation

📅 2026-05-17
📈 Citations: 0
Influential: 0
📄 PDF

career value

220K/year
🤖 AI Summary
This work addresses the challenges of uncertainty quantification, unreliable confidence estimation, and failure detection in visual-inertial odometry under heteroscedastic and multimodal conditions. To this end, we propose the first real-time learning framework based on the Mamba architecture for jointly modeling asynchronous multimodal sensor streams and estimating localization uncertainty. Our approach innovatively leverages Mamba’s powerful sequential modeling capabilities within multimodal state estimation, integrating a multi-source fusion strategy with a learning-driven uncertainty calibration mechanism to achieve efficient and reliable confidence quantification. Extensive experiments demonstrate that our method significantly outperforms existing approaches on both public and self-collected datasets, achieving notable advances in robustness and reliability, while ablation studies confirm the effectiveness of our core design choices.
📝 Abstract
Accurate visual state estimation has been a central topic in robotics with a wide range of applications in robot navigation, autonomous driving, and autonomous flight. Recent advances in robot perception have led to significant improvements in the accuracy and robustness of state estimation, yet a fundamental challenge remains in how to quantify and calibrate its precision, i.e., how confident we are in an estimate and whether failures can be detected. This issue is particularly pronounced in visual-inertial odometry (VIO), where the heteroscedastic and multimodal nature of the problem makes uncertainty quantification especially difficult. This paper introduces MUSE (Multimodal Uncertainty Quantification of State Estimation), a novel real-time learning-based framework that leverages the strong and efficient sequential modeling capacity of Mamba to estimate localization uncertainty from multiple asynchronous sensor streams. Experiments on both public and in-house datasets demonstrate that MUSE achieves superior reliability and robustness compared to existing uncertainty quantification methods, and ablation studies justify the benefits of its key design choices.
Problem

Research questions and friction points this paper is trying to address.

Uncertainty Quantification
State Estimation
Visual-Inertial Odometry
Multimodal
Heteroscedasticity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uncertainty Quantification
Visual-Inertial Odometry
Mamba Architecture
Multimodal Sensor Fusion
State Estimation
M
Minkyung Kim
Department of Mechanical Science and Engineering, University of Illinois Urbana-Champaign, Champaign, IL 61801, USA
H
Henry Che
Siebel School of Computing and Data Science, University of Illinois Urbana-Champaign, Champaign, IL 61801, USA
B
Bhargav Chandaka
Siebel School of Computing and Data Science, University of Illinois Urbana-Champaign, Champaign, IL 61801, USA
B
Bhumsitt Pramuanpornsatid
Siebel School of Computing and Data Science, University of Illinois Urbana-Champaign, Champaign, IL 61801, USA
C
Chengyu Yang
Department of Mechanical Science and Engineering, University of Illinois Urbana-Champaign, Champaign, IL 61801, USA
Sheng Cheng
Sheng Cheng
Postdoc, University of Illinois Urbana-Champaign
aerial roboticsoptimizationadaptive control
Xiaofeng Wang
Xiaofeng Wang
University of South Carolina
Robotics and ControlCyber-Physical SystemsAutonomous SystemsMachine Learning
N
Naira Hovakimyan
Department of Mechanical Science and Engineering, University of Illinois Urbana-Champaign, Champaign, IL 61801, USA
Shenlong Wang
Shenlong Wang
University of Illinois at Urbana-Champaign
Computer VisionRobot PerceptionAutonomous Driving