GaussianZoom: Progressive Zoom-in Generative 3D Gaussian Splatting with Geometric and Semantic Guidance

📅 2026-05-18
📈 Citations: 0
Influential: 0
📄 PDF

career value

222K/year
🤖 AI Summary
This work addresses the challenge of simultaneously preserving fidelity and multi-view consistency in extreme upscaling for 3D scene reconstruction from low-resolution inputs. To this end, the authors propose a progressive generative 3D Gaussian splatting framework that integrates geometrically consistent modeling with multi-scale semantic reasoning. Key components include a multi-view consistent super-resolution module, an extensible continuous level-of-detail structure, depth-guided feature warping, visual-language-model-driven detail synthesis, and a dynamic Gaussian visibility modulation mechanism. Experiments on the Mip-NeRF360 and Tanks & Temples datasets demonstrate that the proposed method significantly outperforms existing approaches in extreme magnification rendering, achieving notable advances in perceptual quality, cross-scale smoothness, and multi-view consistency.
📝 Abstract
We introduce GaussianZoom, a generative zoom-in 3D reconstruction system with an iterative progressive framework that combines geometry-consistent scene modeling and multi-scale semantic reasoning to enable high-fidelity extreme zoom-in rendering from low-resolution inputs. To achieve this, we develop a novel multi-view consistent super-resolution module with depth-based feature warping and VLM-driven detail synthesis, ensuring accurate multi-view correspondence while enriching fine-scale appearance beyond the observed resolution. To support zooming across large magnification ranges, we further introduce a new expandable continuous Level-of-Detail hierarchy that dynamically modulates Gaussian visibility for smooth, alias-free cross-scale rendering. Experiments on Mip-NeRF360 and Tanks\&Temples demonstrate that GaussianZoom achieves superior perceptual quality, multi-view consistency, and robustness under extreme magnification, establishing a strong baseline for generative zoom-in 3D scene reconstruction.
Problem

Research questions and friction points this paper is trying to address.

3D reconstruction
super-resolution
zoom-in
multi-view consistency
generative modeling
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generative 3D Gaussian Splatting
Progressive Zoom-in
Multi-view Consistent Super-resolution
Semantic Guidance
Continuous Level-of-Detail
🔎 Similar Papers
No similar papers found.
J
Jiale Shi
State Key Lab of CAD & CG, Zhejiang University
Jiarui Hu
Jiarui Hu
Zhejiang University
Computer Vision Robotics Computer Graphics
Z
Zesong Yang
State Key Lab of CAD & CG, Zhejiang University
K
Kaixuan Luan
State Key Lab of CAD & CG, Zhejiang University
H
Hujun Bao
State Key Lab of CAD & CG, Zhejiang University
Zhaopeng Cui
Zhaopeng Cui
Zhejiang University
Computer VisionRoboticsComputer Graphics