Agentic Pipeline for Self-Synchronized Multiview Joint Angle Monitoring in Uncalibrated Environments

📅 2026-05-14
📈 Citations: 0
Influential: 0
📄 PDF

career value

197K/year
🤖 AI Summary
This study addresses the challenge of markerless, multi-view motion capture for spinal cord injury patients in home environments without camera calibration or hardware synchronization. To this end, the authors propose an agent-based multi-view video processing framework that, for the first time, integrates multimodal large language models with an agent mechanism into uncalibrated multi-view motion analysis. The approach enables self-synchronization of videos, consistent cross-view target tracking, and self-validation, while combining monocular 2D pose estimation with uncalibrated geometric optimization to robustly extract joint angles. Experimental results demonstrate an average absolute error of 5.97° ± 2.36° in joint angle estimation compared to a Vicon system, with a Pearson correlation coefficient of 0.962 ± 0.014, significantly reducing reliance on traditional calibration and synchronization hardware.
📝 Abstract
Kinematic monitoring plays a critical role in long-term rehabilitation for patients with spinal cord injury (SCI), where multi-view markerless motion capture methods have shown significant potential. However, owing to the reliance on calibration and the difficulty of achieving multi-view synchronization, their deployment in patient self-deployed environments remains challenging. In this work, we propose an agentic pipeline for self-synchronized multi-view joint angle monitoring in uncalibrated environments using two cameras without hardware triggers. The Multimodal large language models enable automatic video synchronization and agent-driven self-verification. State-of-the-art monocular 2D pose estimation models are employed to extract candidate poses, where an agent-based selection mechanism is then applied to automatically identify and track the target subject, thereby producing consistent 2D poses in the presence of multiple individuals and occlusions. Such 2D poses are optimized to estimate joint angles from uncalibrated multi-view pose sequences, ensuring interpretability through explicit geometric modeling. Validation against Vicon system demonstrated the strong performance, achieving an MAE of $5.97^\circ \pm 2.36^\circ$ and a Pearson correlation coefficient of $0.962 \pm 0.014$. The proposed method is expected to provide a practical, patient self-deployable system to perform daily kinematic monitoring in uncalibrated home environments.
Problem

Research questions and friction points this paper is trying to address.

multi-view synchronization
uncalibrated environments
markerless motion capture
joint angle monitoring
self-deployable system
Innovation

Methods, ideas, or system contributions that make the work stand out.

agentic pipeline
self-synchronization
uncalibrated multi-view
markerless motion capture
joint angle estimation
🔎 Similar Papers
No similar papers found.
J
Juncheng Yu
National Engineering Research Center of Neuromodulation, School of Aerospace Engineering, Tsinghua University, Beijing 100084, China
L
Lusi A
National Engineering Research Center of Neuromodulation, School of Aerospace Engineering, Tsinghua University, Beijing 100084, China
Haoxuan Xie
Haoxuan Xie
Nanyang Technological University
Graph algorithms
W
Weiming Wang
National Engineering Research Center of Neuromodulation, School of Aerospace Engineering, Tsinghua University, Beijing 100084, China