Metric, inertially aligned monocular state estimation via kinetodynamic priors

📅 2025-11-25

📈 Citations: 0

✨ Influential: 0

career value

236K/year

🤖 AI Summary

Conventional rigid-body assumptions fail for soft robots due to dynamic elastic deformations, and monocular visual odometry (VO) cannot recover metric scale or gravity direction. Method: We propose a continuous-time state estimation framework integrating physical dynamics priors. Camera trajectory is parameterized via B-splines; elastic deformation is modeled by a multi-layer perceptron (MLP) learning the force–deformation mapping; and visual measurements are explicitly coupled with inertial dynamics through Newton’s second law. The optimization jointly enforces geometric consistency, motion continuity, and physical interpretability. Results: Evaluated on a spring-mounted camera platform, our method significantly improves pose estimation accuracy and robustness. To the best of our knowledge, it is the first monocular VO framework for non-rigid systems that achieves end-to-end perception of true metric scale, gravity orientation, and inertial alignment—without external calibration or prior knowledge of system parameters.

Technology Category

Application Category

📝 Abstract

Accurate state estimation for flexible robotic systems poses significant challenges, particular for platforms with dynamically deforming structures that invalidate rigid-body assumptions. This paper tackles this problem and allows to extend existing rigid-body pose estimation methods to non-rigid systems. Our approach hinges on two core assumptions: first, the elastic properties are captured by an injective deformation-force model, efficiently learned via a Multi-Layer Perceptron; second, we solve the platform's inherently smooth motion using continuous-time B-spline kinematic models. By continuously applying Newton's Second Law, our method establishes a physical link between visually-derived trajectory acceleration and predicted deformation-induced acceleration. We demonstrate that our approach not only enables robust and accurate pose estimation on non-rigid platforms, but that the properly modeled platform physics instigate inertial sensing properties. We demonstrate this feasibility on a simple spring-camera system, and show how it robustly resolves the typically ill-posed problem of metric scale and gravity recovery in monocular visual odometry.

Problem

Research questions and friction points this paper is trying to address.

Extending rigid-body pose estimation to non-rigid robotic systems with deformable structures

Solving metric scale and gravity recovery challenges in monocular visual odometry

Establishing physical links between visual trajectories and deformation-induced accelerations

Innovation

Methods, ideas, or system contributions that make the work stand out.

Learns deformation-force model using Multi-Layer Perceptron

Uses continuous-time B-spline kinematic models

Applies Newton's Second Law for physical acceleration link

🔎 Similar Papers

One Homography is All You Need: IMM-based Joint Homography and Multiple Object State Estimation