EndoWave: Rational-Wavelet 4D Gaussian Splatting for Endoscopic Reconstruction

πŸ“… 2025-10-27
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
In robot-assisted minimally invasive surgery, 3D reconstruction from endoscopic video faces challenges including photometric inconsistency, non-rigid tissue motion, and strong specular reflections. To address these, this paper proposes a spatiotemporally unified 4D Gaussian splatting framework. Methodologically, it introduces optical-flow-guided geometric constraints to enforce temporal consistency and designs a multi-resolution supervision scheme based on rational orthogonal wavelets to jointly optimize appearance and geometry. The approach integrates 4D Gaussian representations, differentiable rendering, optical flow estimation, and wavelet transforms. Evaluated on the real-world EndoNeRF and StereoMIS surgical datasets, our method achieves significant improvements in geometric accuracy and visual fidelity over prior work. Quantitative and qualitative results demonstrate state-of-the-art performance, validating its effectiveness for modeling dynamic endoscopic scenes.

Technology Category

Application Category

πŸ“ Abstract
In robot-assisted minimally invasive surgery, accurate 3D reconstruction from endoscopic video is vital for downstream tasks and improved outcomes. However, endoscopic scenarios present unique challenges, including photometric inconsistencies, non-rigid tissue motion, and view-dependent highlights. Most 3DGS-based methods that rely solely on appearance constraints for optimizing 3DGS are often insufficient in this context, as these dynamic visual artifacts can mislead the optimization process and lead to inaccurate reconstructions. To address these limitations, we present EndoWave, a unified spatio-temporal Gaussian Splatting framework by incorporating an optical flow-based geometric constraint and a multi-resolution rational wavelet supervision. First, we adopt a unified spatio-temporal Gaussian representation that directly optimizes primitives in a 4D domain. Second, we propose a geometric constraint derived from optical flow to enhance temporal coherence and effectively constrain the 3D structure of the scene. Third, we propose a multi-resolution rational orthogonal wavelet as a constraint, which can effectively separate the details of the endoscope and enhance the rendering performance. Extensive evaluations on two real surgical datasets, EndoNeRF and StereoMIS, demonstrate that our method EndoWave achieves state-of-the-art reconstruction quality and visual accuracy compared to the baseline method.
Problem

Research questions and friction points this paper is trying to address.

Addresses photometric inconsistencies and non-rigid motion in endoscopic reconstruction
Enhances temporal coherence using optical flow-based geometric constraints
Improves rendering accuracy through multi-resolution rational wavelet supervision
Innovation

Methods, ideas, or system contributions that make the work stand out.

4D Gaussian Splatting with unified spatio-temporal representation
Optical flow geometric constraint for temporal coherence
Multi-resolution rational wavelet supervision for detail enhancement
πŸ”Ž Similar Papers
No similar papers found.
T
Taoyu Wu
Xi’an Jiaotong-Liverpool University, China
Y
Yiyi Miao
Xi’an Jiaotong-Liverpool University, China
J
Jiaxin Guo
The Chinese University of Hong Kong, Hong Kong, China
Ziyan Chen
Ziyan Chen
Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences
Generative AILow Level Vision
S
Sihang Zhao
Xi’an Jiaotong-Liverpool University, China
Z
Zhuoxiao Li
The Hong Kong University of Science and Technology (Guangzhou), China
Zhe Tang
Zhe Tang
University of Liverpool
WSNIoThybrid network
Baoru Huang
Baoru Huang
University of Liverpool; Imperial College London
RoboticsComputer visionSurgical visionImage-Guided Intervention
Limin Yu
Limin Yu
Xi'an Jiaotong-Liverpool University
sonar detectionrational waveletsmedical image analysisAGV system design