TerraFusion: Joint Generation of Terrain Geometry and Texture Using Latent Diffusion Models

📅 2025-05-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing methods typically generate terrain heightmaps and textures in isolation, neglecting their intrinsic geometric-appearance coupling, thereby limiting visual realism. To address this, we propose the first unsupervised framework for joint heightmap and texture generation, explicitly modeling geometric-appearance coherency. Our approach builds upon a latent diffusion model (LDM), pre-trained on unlabeled paired data to enforce cross-modal consistency without supervision, and incorporates a lightweight external adapter to enable sketch-guided controllable synthesis. Crucially, the framework requires no paired annotations, yet achieves semantically coherent, high-fidelity, and diverse terrain synthesis. Extensive qualitative and quantitative evaluations demonstrate significant improvements over unimodal baselines across multiple metrics. The method shows strong practical utility for high-fidelity terrain modeling in games and film production.

Technology Category

Application Category

📝 Abstract
3D terrain models are essential in fields such as video game development and film production. Since surface color often correlates with terrain geometry, capturing this relationship is crucial to achieving realism. However, most existing methods generate either a heightmap or a texture, without sufficiently accounting for the inherent correlation. In this paper, we propose a method that jointly generates terrain heightmaps and textures using a latent diffusion model. First, we train the model in an unsupervised manner to randomly generate paired heightmaps and textures. Then, we perform supervised learning of an external adapter to enable user control via hand-drawn sketches. Experiments show that our approach allows intuitive terrain generation while preserving the correlation between heightmaps and textures.
Problem

Research questions and friction points this paper is trying to address.

Joint generation of terrain heightmaps and textures
Capturing correlation between geometry and surface color
Enabling user control via hand-drawn sketches
Innovation

Methods, ideas, or system contributions that make the work stand out.

Jointly generates terrain heightmaps and textures
Uses latent diffusion model for unsupervised training
Enables user control via hand-drawn sketches
🔎 Similar Papers
2024-09-12arXiv.orgCitations: 9
K
Kazuki Higo
University of Tsukuba
T
Toshiki Kanai
University of Tsukuba
Y
Yuki Endo
University of Tsukuba
Yoshihiro Kanamori
Yoshihiro Kanamori
University of Tsukuba
Computer GraphicsComputer VisionImage EditingRealtime RenderingNPR