LiDAR-BIND-T: Improving SLAM with Temporally Consistent Cross-Modal LiDAR Reconstruction

📅 2025-09-06

📈 Citations: 0

✨ Influential: 0

career value

213K/year

🤖 AI Summary

This work addresses temporal inconsistency and reconstruction distortion arising from cross-modal fusion (e.g., radar/sonar → LiDAR) in LiDAR-based SLAM. To this end, we propose a temporally aware multimodal fusion framework. Methodologically, it incorporates a temporal embedding alignment module, a motion-aligned loss function, and a windowed temporal fusion mechanism; integrated with Cartographer atop LiDAR-BIND to enable end-to-end SLAM optimization. A key contribution is the introduction of novel temporal quality metrics—including FVMD—to ensure plug-and-play compatibility and robust temporal calibration. Experiments demonstrate significant improvements in pose estimation stability and occupancy map accuracy: the absolute trajectory error (ATE) is reduced by 21.3% on average. Moreover, cross-modal reconstruction achieves superior spatiotemporal consistency and geometric fidelity compared to state-of-the-art baselines.

Technology Category

Application Category

📝 Abstract

This paper extends LiDAR-BIND, a modular multi-modal fusion framework that binds heterogeneous sensors (radar, sonar) to a LiDAR-defined latent space, with mechanisms that explicitly enforce temporal consistency. We introduce three contributions: (i) temporal embedding similarity that aligns consecutive latents, (ii) a motion-aligned transformation loss that matches displacement between predictions and ground truth LiDAR, and (iii) windows temporal fusion using a specialised temporal module. We further update the model architecture to better preserve spatial structure. Evaluations on radar/sonar-to-LiDAR translation demonstrate improved temporal and spatial coherence, yielding lower absolute trajectory error and better occupancy map accuracy in Cartographer-based SLAM (Simultaneous Localisation and Mapping). We propose different metrics based on the Fréchet Video Motion Distance (FVMD) and a correlation-peak distance metric providing practical temporal quality indicators to evaluate SLAM performance. The proposed temporal LiDAR-BIND, or LiDAR-BIND-T, maintains plug-and-play modality fusion while substantially enhancing temporal stability, resulting in improved robustness and performance for downstream SLAM.

Problem

Research questions and friction points this paper is trying to address.

Improving temporal consistency in cross-modal LiDAR reconstruction

Enhancing SLAM performance with better spatial-temporal coherence

Maintaining plug-and-play fusion while boosting temporal stability

Innovation

Methods, ideas, or system contributions that make the work stand out.

Temporal embedding aligns consecutive latent spaces

Motion-aligned transformation loss matches displacement

Windowed temporal fusion with specialized module

🔎 Similar Papers

BiCo-Fusion: Bidirectional Complementary LiDAR-Camera Fusion for Semantic- and Spatial-Aware 3D Object Detection