LiDAR-BIND-T: Improving SLAM with Temporally Consistent Cross-Modal LiDAR Reconstruction

📅 2025-09-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses temporal inconsistency and reconstruction distortion arising from cross-modal fusion (e.g., radar/sonar → LiDAR) in LiDAR-based SLAM. To this end, we propose a temporally aware multimodal fusion framework. Methodologically, it incorporates a temporal embedding alignment module, a motion-aligned loss function, and a windowed temporal fusion mechanism; integrated with Cartographer atop LiDAR-BIND to enable end-to-end SLAM optimization. A key contribution is the introduction of novel temporal quality metrics—including FVMD—to ensure plug-and-play compatibility and robust temporal calibration. Experiments demonstrate significant improvements in pose estimation stability and occupancy map accuracy: the absolute trajectory error (ATE) is reduced by 21.3% on average. Moreover, cross-modal reconstruction achieves superior spatiotemporal consistency and geometric fidelity compared to state-of-the-art baselines.

Technology Category

Application Category

📝 Abstract
This paper extends LiDAR-BIND, a modular multi-modal fusion framework that binds heterogeneous sensors (radar, sonar) to a LiDAR-defined latent space, with mechanisms that explicitly enforce temporal consistency. We introduce three contributions: (i) temporal embedding similarity that aligns consecutive latents, (ii) a motion-aligned transformation loss that matches displacement between predictions and ground truth LiDAR, and (iii) windows temporal fusion using a specialised temporal module. We further update the model architecture to better preserve spatial structure. Evaluations on radar/sonar-to-LiDAR translation demonstrate improved temporal and spatial coherence, yielding lower absolute trajectory error and better occupancy map accuracy in Cartographer-based SLAM (Simultaneous Localisation and Mapping). We propose different metrics based on the Fréchet Video Motion Distance (FVMD) and a correlation-peak distance metric providing practical temporal quality indicators to evaluate SLAM performance. The proposed temporal LiDAR-BIND, or LiDAR-BIND-T, maintains plug-and-play modality fusion while substantially enhancing temporal stability, resulting in improved robustness and performance for downstream SLAM.
Problem

Research questions and friction points this paper is trying to address.

Improving temporal consistency in cross-modal LiDAR reconstruction
Enhancing SLAM performance with better spatial-temporal coherence
Maintaining plug-and-play fusion while boosting temporal stability
Innovation

Methods, ideas, or system contributions that make the work stand out.

Temporal embedding aligns consecutive latent spaces
Motion-aligned transformation loss matches displacement
Windowed temporal fusion with specialized module
🔎 Similar Papers
N
Niels Balemans
IDLab - Faculty of Applied Engineering, University of Antwerp - imec, Sint-Pietersvliet 7, 2000 Antwerp, Belgium
Ali Anwar
Ali Anwar
Assistant Professor, University of Minnesota
Distributed SystemsMachine Learning SystemsStorage SystemsCloudHPC
J
Jan Steckel
Cosys-Lab - Faculty of Applied Engineering, University of Antwerp, 2000 Antwerp, Belgium; Flanders Make Strategic Research Centre, Lommel, Belgium
S
Siegfried Mercelis
IDLab - Faculty of Applied Engineering, University of Antwerp - imec, Sint-Pietersvliet 7, 2000 Antwerp, Belgium