Keypoint-based Dynamic Object 6-DoF Pose Tracking via Event Camera

📅 2026-04-25
📈 Citations: 0
Influential: 0
📄 PDF

career value

213K/year
🤖 AI Summary
This work addresses the challenge of 6-degree-of-freedom pose estimation for dynamic objects under adverse conditions such as motion blur, sensor noise, and low illumination by proposing a novel event camera–based approach. Leveraging the high dynamic range and low latency of event cameras, the method extracts keypoints from time surfaces and achieves robust tracking by integrating event polarity, spatial coordinates, and local event density—a contribution introduced here for the first time in keypoint tracking. Furthermore, an efficient 2D–3D keypoint hashing scheme is designed to enable lightweight correspondence mapping, which, combined with the EPnP algorithm, yields accurate pose estimates. Evaluated on both synthetic and real-world datasets, the proposed method significantly outperforms existing approaches, achieving notable advances in both accuracy and robustness.

Technology Category

Application Category

📝 Abstract
Accurate 6-DoF pose estimation of objects is critical for robots to perform precise manipulation tasks. However, for dynamic object pose estimation, conventional camera-based approaches face several major challenges, such as motion blur, sensor noise, and low-light limitation. To address these issues, we employ event cameras, whose high dynamic range and low latency offer a promising solution. Furthermore, we propose a keypoint-based detection and tracking approach for dynamic object pose estimation. Firstly, a keypoint detection network is constructed to extract keypoints from the time surface generated by the event stream. Subsequently, the polarity and spatial coordinates of the events are leveraged, and the event density in the vicinity of each keypoint is utilized to achieve continuous keypoint tracking. Finally, a hash mapping is established between the 2D keypoints and the 3D model keypoints, and the EPnP algorithm is employed to estimate the 6-DoF pose. Experimental results demonstrate that, whether in simulated or real event environments, the proposed method outperforms the event-based state-of-the-art methods in terms of both accuracy and robustness.
Problem

Research questions and friction points this paper is trying to address.

6-DoF pose estimation
dynamic object
event camera
motion blur
low-light
Innovation

Methods, ideas, or system contributions that make the work stand out.

event camera
6-DoF pose estimation
keypoint tracking
time surface
EPnP
Z
Zhe Wang
ShanghaiTech Automation and Robotics (STAR) Center, School of Information Science and Technology, ShanghaiTech University, Shanghai, 201210, China
Q
Qijin Song
ShanghaiTech Automation and Robotics (STAR) Center, School of Information Science and Technology, ShanghaiTech University, Shanghai, 201210, China
Zihao Li
Zihao Li
China University of Geoscience, Wuhan
Computer VisionRemote SensingDeep Learning
Jingyu Xiao
Jingyu Xiao
Tsinghua University
Data MiningLarge Language ModelsComputer NetworkMLLM4Code
W
Weibang Bai
ShanghaiTech Automation and Robotics (STAR) Center, School of Information Science and Technology, ShanghaiTech University, Shanghai, 201210, China