Geometry OR Tracker: Universal Geometric Operating Room Tracking

📅 2026-02-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of multi-view 3D tracking in operating rooms, where inaccurate camera calibration and RGB-D registration often induce geometric inconsistencies across views, leading to fusion artifacts (“ghosting”) and degraded trajectory accuracy in a shared coordinate system. To mitigate this, the authors propose a two-stage decoupled approach: first, a multi-view metric geometry correction module transforms imprecise calibrations into a globally scale-consistent geometric alignment; second, occlusion-robust 3D point tracking is performed within the unified world coordinate frame. By decoupling geometric consistency correction from tracking—a novel strategy in this domain—the method significantly enhances both fusion stability and tracking precision. Evaluated on the MM-OR benchmark, the correction front-end reduces cross-view depth inconsistency by over 30×, demonstrating the critical role of geometric consistency in improving tracking performance.

Technology Category

Application Category

📝 Abstract
In operating rooms (OR), world-scale multi-view 3D tracking supports downstream applications such as surgeon behavior recognition, where physically meaningful quantities such as distances and motion statistics must be measured in meters. However, real clinical deployments rarely satisfy the geometric prerequisites for stable multi-view fusion and tracking: camera calibration and RGB-D registration are always unreliable, leading to cross-view geometric inconsistency that produces "ghosting" during fusion and degrades 3D trajectories in a shared OR coordinate frame. To address this, we introduce Geometry OR Tracker, a two-stage pipeline that first rectifies imprecise calibration into a scaleconsistent and geometrically consistent camera setup with a single global scale via a Multi-view Metric Geometry Rectification module, and then performs Occlusion-Robust 3D Point Tracking directly in the unified OR world frame. On the MM-OR benchmark, improved geometric consistency translates into tracking gains: our rectification front-end reduces cross-view depth disagreement by more than 30$\times$ compared to raw calibration. Ablation studies further demonstrate the relationship between calibration quality and tracking accuracy, showing that improved geometric consistency yields stronger world-frame tracking.
Problem

Research questions and friction points this paper is trying to address.

multi-view 3D tracking
geometric consistency
camera calibration
RGB-D registration
operating room
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-view Metric Geometry Rectification
Occlusion-Robust 3D Tracking
Geometric Consistency
Operating Room Tracking
World-scale 3D Reconstruction
🔎 Similar Papers
No similar papers found.
Y
Yihua Shao
Centre for Artificial Intelligence and Robotics Hong Kong Institute of Science and Innovation Chinese Academy of Sciences; State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS) Institute of Automation Chinese Academy of Sciences (CASIA)
Kang Chen
Kang Chen
Peking University
Event CameraSpike CameraGreen Infrastructure
F
Feng Xue
University of Trento, Trento, Italy
Siyu Chen
Siyu Chen
USTB, CASIA
Efficient AIModel CompressionAutomatic Driving3D Reconstruction
Long Bai
Long Bai
Research Assistant, Institute of Computing Technology, Chinese Academy of Sciences
Event-Centric AnalysisKnowledge GraphNatural Language Processing
H
Hongyuan Yu
Xiaomi Inc.
Hao Tang
Hao Tang
Peking University
computer vision
Jinlin Wu
Jinlin Wu
Institute of Automation,Chinese Academy of Sciences
Nassir Navab
Nassir Navab
Professor of Computer Science, Technische Universität München