GaussianGrow: Geometry-aware Gaussian Growing from 3D Point Clouds with Text Guidance

📅 2026-04-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of generating geometrically accurate 3D Gaussian representations from scratch in the absence of reliable geometric priors. It introduces, for the first time, a text-guided, progressive growth process that starts from sparse point clouds. The method leverages a multi-view diffusion model to provide appearance supervision and iteratively completes unobserved regions through 2D image inpainting, camera pose optimization, and dynamic viewpoint selection. A geometry-aware strategy is further integrated to enhance structural completeness. Experiments demonstrate that the proposed approach consistently produces high-quality, geometrically coherent, and semantically controllable 3D Gaussian scenes from both synthetic and real-world scanned point clouds, significantly outperforming existing text-to-3D generation methods.
📝 Abstract
3D Gaussian Splatting has demonstrated superior performance in rendering efficiency and quality, yet the generation of 3D Gaussians still remains a challenge without proper geometric priors. Existing methods have explored predicting point maps as geometric references for inferring Gaussian primitives, while the unreliable estimated geometries may lead to poor generations. In this work, we introduce GaussianGrow, a novel approach that generates 3D Gaussians by learning to grow them from easily accessible 3D point clouds, naturally enforcing geometric accuracy in Gaussian generation. Specifically, we design a text-guided Gaussian growing scheme that leverages a multi-view diffusion model to synthesize consistent appearances from input point clouds for supervision. To mitigate artifacts caused by fusing neighboring views, we constrain novel views generated at non-preset camera poses identified in overlapping regions across different views. For completing the hard-to-observe regions, we propose to iteratively detect the camera pose by observing the largest un-grown regions in point clouds and inpainting them by inpainting the rendered view with a pretrained 2D diffusion model. The process continues until complete Gaussians are generated. We extensively evaluate GaussianGrow on text-guided Gaussian generation from synthetic and even real-scanned point clouds. Project Page: https://weiqi-zhang.github.io/GaussianGrow
Problem

Research questions and friction points this paper is trying to address.

3D Gaussian Splatting
geometry-aware generation
text-guided synthesis
3D point clouds
geometric priors
Innovation

Methods, ideas, or system contributions that make the work stand out.

3D Gaussian Splatting
point cloud
text-to-3D
diffusion model
geometry-aware generation
🔎 Similar Papers
No similar papers found.
Weiqi Zhang
Weiqi Zhang
Tsinghua University
3D Computer VisionGenerative Model
Junsheng Zhou
Junsheng Zhou
Tsinghua University
3D computer vision
H
Haotian Geng
School of Software, Tsinghua University, Beijing, China
K
Kanle Shi
Kuaishou Technology, Beijing, China
S
Shenkun Xu
Kuaishou Technology, Beijing, China
Yi Fang
Yi Fang
Associate Professor of NYU Abu Dhabi and NYU Tandon
3D Computer Vision3D Deep Learning3D Meta Learning
Y
Yu-Shen Liu
School of Software, Tsinghua University, Beijing, China