EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images

πŸ“… 2025-03-06
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing semantic mapping methods in unstructured environments suffer from overconfident semantic predictions, sparse and noisy depth perception, and inconsistent map representations. To address these issues, this paper proposes EvidKimeraβ€”a novel framework that, for the first time, integrates evidential uncertainty modeling into the end-to-end semantic surface mapping pipeline. We design an evidential depth loss to jointly optimize depth estimation and semantic segmentation under a principled uncertainty-aware objective. EvidKimera unifies monocular depth estimation, semantic segmentation, multi-task learning, and Kimera SLAM. On NYU Depth V2, it yields well-calibrated uncertainty estimates. With zero-shot transfer to ScanNetV2, EvidKimera achieves significantly higher accuracy and consistency in semantic surface maps compared to the baseline Kimera system.

Technology Category

Application Category

πŸ“ Abstract
For scene understanding in unstructured environments, an accurate and uncertainty-aware metric-semantic mapping is required to enable informed action selection by autonomous systems.Existing mapping methods often suffer from overconfident semantic predictions, and sparse and noisy depth sensing, leading to inconsistent map representations. In this paper, we therefore introduce EvidMTL, a multi-task learning framework that uses evidential heads for depth estimation and semantic segmentation, enabling uncertainty-aware inference from monocular RGB images. To enable uncertainty-calibrated evidential multi-task learning, we propose a novel evidential depth loss function that jointly optimizes the belief strength of the depth prediction in conjunction with evidential segmentation loss. Building on this, we present EvidKimera, an uncertainty-aware semantic surface mapping framework, which uses evidential depth and semantics prediction for improved 3D metric-semantic consistency. We train and evaluate EvidMTL on the NYUDepthV2 and assess its zero-shot performance on ScanNetV2, demonstrating superior uncertainty estimation compared to conventional approaches while maintaining comparable depth estimation and semantic segmentation. In zero-shot mapping tests on ScanNetV2, EvidKimera outperforms Kimera in semantic surface mapping accuracy and consistency, highlighting the benefits of uncertainty-aware mapping and underscoring its potential for real-world robotic applications.
Problem

Research questions and friction points this paper is trying to address.

Overconfident semantic predictions in existing mapping methods
Sparse and noisy depth sensing in unstructured environments
Inconsistent map representations for autonomous systems
Innovation

Methods, ideas, or system contributions that make the work stand out.

Evidential heads for depth and semantic tasks
Novel evidential depth loss function
EvidKimera for improved 3D semantic consistency
πŸ”Ž Similar Papers
No similar papers found.