LoG3D: Ultra-High-Resolution 3D Shape Modeling via Local-to-Global Partitioning

📅 2025-11-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing high-fidelity 3D generation methods face challenges in modeling arbitrary topologies (e.g., open surfaces, non-manifold structures), require watertight preprocessing for signed distance fields (SDFs), and suffer from sampling artifacts in point-cloud representations. To address these issues, this paper proposes a local-to-global (LoG) generative architecture based on unsigned distance fields (UDFs). Our key contributions are: (1) a UBlock tiling mechanism with Pad-Average strategy enabling stable modeling at ultra-high resolution (2048³); (2) hybrid geometric modeling combining 3D convolutions for local geometric detail capture and sparse Transformers for global structural coherence; and (3) an end-to-end variational autoencoder training framework. Experiments demonstrate state-of-the-art performance in reconstruction accuracy, surface smoothness, and topological flexibility—achieving, for the first time, high-resolution, high-quality unified modeling of complex non-manifold and open-surface geometries.

Technology Category

Application Category

📝 Abstract
Generating high-fidelity 3D contents remains a fundamental challenge due to the complexity of representing arbitrary topologies-such as open surfaces and intricate internal structures-while preserving geometric details. Prevailing methods based on signed distance fields (SDFs) are hampered by costly watertight preprocessing and struggle with non-manifold geometries, while point-cloud representations often suffer from sampling artifacts and surface discontinuities. To overcome these limitations, we propose a novel 3D variational autoencoder (VAE) framework built upon unsigned distance fields (UDFs)-a more robust and computationally efficient representation that naturally handles complex and incomplete shapes. Our core innovation is a local-to-global (LoG) architecture that processes the UDF by partitioning it into uniform subvolumes, termed UBlocks. This architecture couples 3D convolutions for capturing local detail with sparse transformers for enforcing global coherence. A Pad-Average strategy further ensures smooth transitions at subvolume boundaries during reconstruction. This modular design enables seamless scaling to ultra-high resolutions up to 2048^3-a regime previously unattainable for 3D VAEs. Experiments demonstrate state-of-the-art performance in both reconstruction accuracy and generative quality, yielding superior surface smoothness and geometric flexibility.
Problem

Research questions and friction points this paper is trying to address.

Handling non-manifold geometries and open surfaces in 3D modeling
Overcoming limitations of signed distance fields and point clouds
Achieving ultra-high-resolution 3D shape reconstruction and generation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses unsigned distance fields for 3D representation
Local-to-global architecture with UBlocks partitioning
Combines 3D convolutions with sparse transformers
🔎 Similar Papers
No similar papers found.
X
Xinran Yang
Nanjing University
S
Shuichang Lai
Alibaba Group
Jiangjing Lyu
Jiangjing Lyu
Alibaba
Computer VisionComputer Graphics
Hongjie Li
Hongjie Li
Peking University
Computer Graphics
B
Bowen Pan
Alibaba Group
Y
Yuanqi Li
Nanjing University
J
Jie Guo
Nanjing University
Z
Zhengkang Zhou
Nanjing Urban Construction Tunnel&Bridge Intelligent Management
Y
Yanwen Guo
Nanjing University