Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation

📅 2025-01-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing automatic mesh generation methods struggle to simultaneously ensure surface continuity, structural fidelity, and scalability during the intermediate representation stage, often resulting in mesh distortion, suboptimal vertex density distribution, and limited capability for large-scale object processing. This paper proposes an end-to-end framework for high-quality triangular mesh generation. We introduce a novel local-aware tokenization algorithm that preserves face adjacency relationships to guarantee topological consistency; design a dual-stream point conditioner that jointly models global semantics and local geometry; and integrate a locality-aware autoencoder with multi-scale geometric guidance. Our method supports high-fidelity mesh synthesis with up to 5,000 faces, achieving significant improvements over state-of-the-art approaches in structural integrity, surface smoothness, and scalability. To our knowledge, this is the first method enabling artist-level quality, large-scale triangular mesh generation in a fully end-to-end manner.

Technology Category

Application Category

📝 Abstract
Triangle meshes are fundamental to 3D applications, enabling efficient modification and rasterization while maintaining compatibility with standard rendering pipelines. However, current automatic mesh generation methods typically rely on intermediate representations that lack the continuous surface quality inherent to meshes. Converting these representations into meshes produces dense, suboptimal outputs. Although recent autoregressive approaches demonstrate promise in directly modeling mesh vertices and faces, they are constrained by the limitation in face count, scalability, and structural fidelity. To address these challenges, we propose Nautilus, a locality-aware autoencoder for artist-like mesh generation that leverages the local properties of manifold meshes to achieve structural fidelity and efficient representation. Our approach introduces a novel tokenization algorithm that preserves face proximity relationships and compresses sequence length through locally shared vertices and edges, enabling the generation of meshes with an unprecedented scale of up to 5,000 faces. Furthermore, we develop a Dual-stream Point Conditioner that provides multi-scale geometric guidance, ensuring global consistency and local structural fidelity by capturing fine-grained geometric features. Extensive experiments demonstrate that Nautilus significantly outperforms state-of-the-art methods in both fidelity and scalability.
Problem

Research questions and friction points this paper is trying to address.

Smooth Surface Maintenance
Optimal Mesh Density
Large-scale Object Processing
Innovation

Methods, ideas, or system contributions that make the work stand out.

Nautilus Autoencoder
Dual-Stream Point Conditioner
Highly Detailed Mesh Generation
🔎 Similar Papers
No similar papers found.
Y
Yuxuan Wang
Nanyang Technological University, Singapore
Xuanyu Yi
Xuanyu Yi
ByteDance Seed
3D VisionGenerative Model
Haohan Weng
Haohan Weng
South China University of Technology
generative modelscomputer vision
Qingshan Xu
Qingshan Xu
Nanyang Technological University
Computer Vision3D Reconstruction
X
Xiaokang Wei
The Hong Kong Polytechnic University, Hong Kong, China
X
Xianghui Yang
Tencent Hunyuan, China
C
Chunchao Guo
Tencent Hunyuan, China
L
Long Chen
Hong Kong University of Science and Technology, Hong Kong, China
H
Hanwang Zhang
Nanyang Technological University, Singapore