DIJE: Dense Image Jacobian Estimation for Robust Robotic Self-Recognition and Visual Servoing

📅 2025-07-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Robots require real-time, markerless perception of both ego-state and tool configuration—without prior structural knowledge—to achieve robust visual servoing. To address this, we propose Dense Image Jacobian Estimation (DIJE), the first method enabling real-time, pixel-wise estimation of the full image Jacobian matrix. DIJE jointly models optical flow dynamics and employs a simplified Kalman filter to effectively decouple ego-motion from external scene dynamics—even under motion occlusion—enabling complex tasks such as dual-arm tool-tip control. Crucially, DIJE operates solely on monocular video input, requiring no joint encoders or camera-to-robot calibration. Integrated with self-identification and visual servoing controllers, it forms an end-to-end visuomotor policy framework. Evaluated on a physical musculoskeletal robot, DIJE significantly improves markerless self-identification accuracy and visual servoing stability, establishing a scalable, perception-first foundation for general-purpose embodied manipulation.

Technology Category

Application Category

📝 Abstract
For robots to move in the real world, they must first correctly understand the state of its own body and the tools that it holds. In this research, we propose DIJE, an algorithm to estimate the image Jacobian for every pixel. It is based on an optical flow calculation and a simplified Kalman Filter that can be efficiently run on the whole image in real time. It does not rely on markers nor knowledge of the robotic structure. We use the DIJE in a self-recognition process which can robustly distinguish between movement by the robot and by external entities, even when the motion overlaps. We also propose a visual servoing controller based on DIJE, which can learn to control the robot's body to conduct reaching movements or bimanual tool-tip control. The proposed algorithms were implemented on a physical musculoskeletal robot and its performance was verified. We believe that such global estimation of the visuomotor policy has the potential to be extended into a more general framework for manipulation.
Problem

Research questions and friction points this paper is trying to address.

Estimates dense image Jacobian for robotic self-recognition
Enables robust distinction between robot and external motion
Develops visual servoing controller for reaching and tool control
Innovation

Methods, ideas, or system contributions that make the work stand out.

Estimates image Jacobian for every pixel
Uses optical flow and simplified Kalman Filter
Enables self-recognition and visual servoing
🔎 Similar Papers
No similar papers found.