Cross-Modal Reinforcement Learning for Navigation with Degraded Depth Measurements

📅 2026-03-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of robust robotic navigation under severe depth sensor degradation caused by low illumination or reflective surfaces. To mitigate this issue, the authors propose a cross-modal reinforcement learning framework that leverages a cross-modal Wasserstein autoencoder to enforce consistency between depth maps and grayscale images in a shared latent space. This enables the system to infer depth-relevant features from grayscale imagery when depth data is unreliable or missing. Notably, this approach is the first to effectively integrate cross-modally aligned latent representations into navigation policies. Extensive experiments demonstrate significant improvements in navigation robustness under depth-degraded conditions, both in simulation and real-world environments, with successful zero-shot sim-to-real transfer achieved without additional fine-tuning.

Technology Category

Application Category

📝 Abstract
This paper presents a cross-modal learning framework that exploits complementary information from depth and grayscale images for robust navigation. We introduce a Cross-Modal Wasserstein Autoencoder that learns shared latent representations by enforcing cross-modal consistency, enabling the system to infer depth-relevant features from grayscale observations when depth measurements are corrupted. The learned representations are integrated with a Reinforcement Learning-based policy for collision-free navigation in unstructured environments when depth sensors experience degradation due to adverse conditions such as poor lighting or reflective surfaces. Simulation and real-world experiments demonstrate that our approach maintains robust performance under significant depth degradation and successfully transfers to real environments.
Problem

Research questions and friction points this paper is trying to address.

cross-modal learning
depth degradation
reinforcement learning
navigation
sensor fusion
Innovation

Methods, ideas, or system contributions that make the work stand out.

Cross-Modal Learning
Wasserstein Autoencoder
Reinforcement Learning
Depth Degradation
Robust Navigation
🔎 Similar Papers
No similar papers found.
O
Omkar Sawant
Department of Engineering Cybernetics, Norwegian University of Science and Technology (NTNU), Norway
L
Luca Zanatta
Department of Engineering Cybernetics, Norwegian University of Science and Technology (NTNU), Norway
Grzegorz Malczyk
Grzegorz Malczyk
Autonomous Robots Lab, NTNU
RoboticsArtificial IntelligencePath PlanningControlState Estimation
Kostas Alexis
Kostas Alexis
NTNU - Norwegian University of Science and Technology
RoboticsUnmanned Aerial VehiclesControlPath PlanningPerception