Event-based Egocentric Human Pose Estimation in Dynamic Environment

📅 2025-05-28

📈 Citations: 0

✨ Influential: 0

career value

188K/year

🤖 AI Summary

This work addresses human pose estimation from forward-facing, head-mounted event cameras under dynamic conditions—a challenging scenario where conventional RGB-based methods fail due to low illumination and high-speed motion. Method: We formally define and tackle the novel task of “forward-facing first-person egocentric pose estimation.” Our approach introduces a Motion Segmentation module that leverages event streams for dynamic object segmentation to suppress background clutter, coupled with a head-pose-conditioned modeling mechanism to guide accurate full-body pose generation. Contribution/Results: We construct EgoEvent, the first synthetic dynamic event dataset tailored to this task, built upon EgoBody. Evaluated on a custom dynamic event test set, our method achieves significant improvements over baselines across four key metrics—demonstrating superior robustness and state-of-the-art performance in complex, real-world dynamic environments.

Technology Category

Application Category

📝 Abstract

Estimating human pose using a front-facing egocentric camera is essential for applications such as sports motion analysis, VR/AR, and AI for wearable devices. However, many existing methods rely on RGB cameras and do not account for low-light environments or motion blur. Event-based cameras have the potential to address these challenges. In this work, we introduce a novel task of human pose estimation using a front-facing event-based camera mounted on the head and propose D-EventEgo, the first framework for this task. The proposed method first estimates the head poses, and then these are used as conditions to generate body poses. However, when estimating head poses, the presence of dynamic objects mixed with background events may reduce head pose estimation accuracy. Therefore, we introduce the Motion Segmentation Module to remove dynamic objects and extract background information. Extensive experiments on our synthetic event-based dataset derived from EgoBody, demonstrate that our approach outperforms our baseline in four out of five evaluation metrics in dynamic environments.

Problem

Research questions and friction points this paper is trying to address.

Estimating human pose with egocentric event cameras in dynamic environments

Addressing low-light and motion blur challenges in pose estimation

Improving head pose accuracy by removing dynamic object interference

Innovation

Methods, ideas, or system contributions that make the work stand out.

Event-based camera for pose estimation

Motion Segmentation Module removes dynamic objects

Head pose conditions generate body poses

🔎 Similar Papers

No similar papers found.

ByteDance

San Jose

Vision Scientist - PICO Lab - San Jose

ByteDance

San Jose

Research Scientist Intern, Machine Perception for Input and Interaction (PhD)