L2Calib: $SE(3)$-Manifold Reinforcement Learning for Robust Extrinsic Calibration with Degenerate Motion Resilience

📅 2025-08-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing extrinsic calibration methods rely on structured targets or strong ego-motion excitation, limiting their applicability in real-world scenarios; online calibration methods, in contrast, often fail under weak excitation. This paper proposes the first reinforcement learning–based framework for online extrinsic calibration, formulating calibration as a sequential decision-making problem over $SE(3)$ pose optimization. We innovatively model 3D rotations using the Bingham distribution to rigorously preserve quaternion antipodal symmetry. A trajectory-alignment reward function and an automatic data filtering module are designed to ensure robust convergence without structured targets or strong motion. The method requires no prior knowledge of initial extrinsics and supports diverse platforms—including UAVs, autonomous vehicles, and handheld devices—using only routine operational data. Experimental results demonstrate superior accuracy, convergence stability, and generalizability compared to conventional approaches.

Technology Category

Application Category

📝 Abstract
Extrinsic calibration is essential for multi-sensor fusion, existing methods rely on structured targets or fully-excited data, limiting real-world applicability. Online calibration further suffers from weak excitation, leading to unreliable estimates. To address these limitations, we propose a reinforcement learning (RL)-based extrinsic calibration framework that formulates extrinsic calibration as a decision-making problem, directly optimizes $SE(3)$ extrinsics to enhance odometry accuracy. Our approach leverages a probabilistic Bingham distribution to model 3D rotations, ensuring stable optimization while inherently retaining quaternion symmetry. A trajectory alignment reward mechanism enables robust calibration without structured targets by quantitatively evaluating estimated tightly-coupled trajectory against a reference trajectory. Additionally, an automated data selection module filters uninformative samples, significantly improving efficiency and scalability for large-scale datasets. Extensive experiments on UAVs, UGVs, and handheld platforms demonstrate that our method outperforms traditional optimization-based approaches, achieving high-precision calibration even under weak excitation conditions. Our framework simplifies deployment on diverse robotic platforms by eliminating the need for high-quality initial extrinsics and enabling calibration from routine operating data. The code is available at https://github.com/APRIL-ZJU/learn-to-calibrate.
Problem

Research questions and friction points this paper is trying to address.

Robust extrinsic calibration for multi-sensor fusion
Overcoming weak excitation in online calibration
Eliminating need for structured targets or full excitation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Reinforcement learning optimizes SE(3) extrinsics directly
Bingham distribution models 3D rotations stably
Automated data selection improves efficiency and scalability
🔎 Similar Papers
No similar papers found.
Baorun Li
Baorun Li
Zhejiang university
roboticsmanipulationslam
C
Chengrui Zhu
Institute of Cyber-Systems and Control, Zhejiang University, China
Siyi Du
Siyi Du
PhD Student at Imperial College London
Deep LearningMultimodal LearningBiomedical Imaging
B
Bingran Chen
Institute of Cyber-Systems and Control, Zhejiang University, China
J
Jie Ren
Institute of Cyber-Systems and Control, Zhejiang University, China
W
Wenfei Wang
Zhejiang Guozi Robotics Technology Co., Ltd.
Y
Yong Liu
Institute of Cyber-Systems and Control, Zhejiang University, China; State Key Laboratory of Industrial Control Technology, Zhejiang University, China
Jiajun Lv
Jiajun Lv
Zhejiang University
SLAM