From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction

📅 2025-03-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the clinical challenge of spatial constraints and impracticality of deploying depth sensors in minimally invasive tumor resection, this paper proposes a monocular RGB image–driven method for 3D anatomical reconstruction and intraoperative navigation. By integrating semantic segmentation with an optimized Structure-from-Motion (SfM) pipeline, the approach generates anatomically consistent, high-fidelity 3D segmented point clouds, enabling real-time intraoperative 3D scene understanding. Evaluated for the first time on airway tumor resection, the method achieves reconstruction accuracy comparable to—or exceeding—that of state-of-the-art RGB-D systems, without requiring dedicated depth hardware. It matches or surpasses RGB-D baselines in critical metrics such as postoperative tissue model reconstruction, while significantly reducing system footprint and cost. This work establishes a compact, depth-sensor-free paradigm for monocular surgical autonomous navigation, providing a clinically viable technical pathway toward real-time intraoperative guidance and automated surgical intervention.

Technology Category

Application Category

📝 Abstract
Surgical automation requires precise guidance and understanding of the scene. Current methods in the literature rely on bulky depth cameras to create maps of the anatomy, however this does not translate well to space-limited clinical applications. Monocular cameras are small and allow minimally invasive surgeries in tight spaces but additional processing is required to generate 3D scene understanding. We propose a 3D mapping pipeline that uses only RGB images to create segmented point clouds of the target anatomy. To ensure the most precise reconstruction, we compare different structure from motion algorithms' performance on mapping the central airway obstructions, and test the pipeline on a downstream task of tumor resection. In several metrics, including post-procedure tissue model evaluation, our pipeline performs comparably to RGB-D cameras and, in some cases, even surpasses their performance. These promising results demonstrate that automation guidance can be achieved in minimally invasive procedures with monocular cameras. This study is a step toward the complete autonomy of surgical robots.
Problem

Research questions and friction points this paper is trying to address.

Develops 3D mapping using monocular cameras for tumor resection.
Compares structure from motion algorithms for precise airway mapping.
Demonstrates monocular camera effectiveness in minimally invasive surgeries.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Monocular camera for 3D reconstruction
RGB images to segmented point clouds
Structure from motion algorithms compared
Ayberk Acar
Ayberk Acar
Computer Science Ph.D. Student, Vanderbilt University
Medical ImagingSurgical RoboticsExtended RealityComputer Vision
M
Mariana Smith
Department of Mechanical Engineering, Johns Hopkins University, Baltimore, MD 21211, USA
Lidia Al-Zogbi
Lidia Al-Zogbi
Postdoctoral Scholar, Vanderbilt University
Medical RoboticsImage-Guided InterventionsUltrasoundModeling & Simulation
F
Fangjie Li
Department of Computer Science, Vanderbilt University, Nashville, TN 37235, USA
H
Hao Li
Department of Computer Science, Vanderbilt University, Nashville, TN 37235, USA
Nural Yilmaz
Nural Yilmaz
Postdoctoral Fellow @Johns Hopkins University
RoboticsHapticsSurgical RobotsDeep LearningTeleoperation
P
P. M. Scheikl
Department of Mechanical Engineering, Johns Hopkins University, Baltimore, MD 21211, USA
J
Jesse F. d'Almeida
Department of Mechanical Engineering, Vanderbilt University, Nashville, TN 37235, USA
S
Susheela Sharma
Virtuoso Surgical, Nashville, TN 37205, USA
L
Lauren Branscombe
Virtuoso Surgical, Nashville, TN 37205, USA
T
Tayfun Efe Ertop
Department of Mechanical, Aerospace and Biomedical Engineering, University of Tennessee, Knoxville, TN 37996, USA
R
Robert J. Webster
Department of Mechanical Engineering, Vanderbilt University, Nashville, TN 37235, USA
Ipek Oguz
Ipek Oguz
Vanderbilt University
Medical image computingmedical image analysissegmentationimage registrationrodent imaging
Alan Kuntz
Alan Kuntz
Assistant Professor, Robotics Center and Kahlert School of Computing, University of Utah
RoboticsRobot Motion PlanningMedical RoboticsSurgical RoboticsDesign Optimization
Axel Krieger
Axel Krieger
Associate Professor, Johns Hopkins University
Medical DevicesRobotics
Jie Ying Wu
Jie Ying Wu
Assistant Professor in CS, Vanderbilt University
Medical RoboticsModelling and SimulationMachine LearningTelerobotics