GeoFusion-CAD: Structure-Aware Diffusion with Geometric State Space for Parametric 3D Design

📅 2026-03-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing methods struggle to generate long, parameterized CAD sequences with complex geometric and topological dependencies, while Transformer-based approaches are hindered by quadratic attention costs and limited context length. This work proposes the first end-to-end diffusion framework based on state space models, encoding CAD programs as hierarchical tree structures and modeling them within a joint geometric-topological state space. We introduce a lightweight C-Mamba module to efficiently capture long-range dependencies and develop a structure-aware diffusion mechanism. To support comprehensive evaluation, we release DeepCAD-240, a new benchmark dataset featuring sequences of up to 240 command steps. Experiments demonstrate that our method significantly outperforms existing Transformer models in both short and long sequence generation, achieving state-of-the-art performance in geometric fidelity and topological consistency.

Technology Category

Application Category

📝 Abstract
Parametric Computer-Aided Design (CAD) is fundamental to modern 3D modeling, yet existing methods struggle to generate long command sequences, especially under complex geometric and topological dependencies. Transformer-based architectures dominate CAD sequence generation due to their strong dependency modeling, but their quadratic attention cost and limited context windowing hinder scalability to long programs. We propose GeoFusion-CAD, an end-to-end diffusion framework for scalable and structure-aware generation. Our proposal encodes CAD programs as hierarchical trees, jointly capturing geometry and topology within a state-space diffusion process. Specifically, a lightweight C-Mamba block models long-range structural dependencies through selective state transitions, enabling coherent generation across extended command sequences. To support long-sequence evaluation, we introduce DeepCAD-240, an extended benchmark that increases the sequence length ranging from 40 to 240 while preserving sketch-extrusion semantics from the ABC dataset. Extensive experiments demonstrate that GeoFusion-CAD achieves superior performance on both short and long command ranges, maintaining high geometric fidelity and topological consistency where Transformer-based models degrade. Our approach sets new state-of-the-art scores for long-sequence parametric CAD generation, establishing a scalable foundation for next-generation CAD modeling systems. Code and datasets are available at GitHub.
Problem

Research questions and friction points this paper is trying to address.

parametric CAD
long command sequences
geometric dependencies
topological dependencies
scalability
Innovation

Methods, ideas, or system contributions that make the work stand out.

diffusion model
state space model
parametric CAD
long-sequence generation
C-Mamba
X
Xiaolei Zhou
Zhejiang University of Technology
C
Chuangjie Fang
Zhejiang University of Technology
J
Jie Wu
Hangzhou International Innovation Institute, Beihang University
Jingyi Yang
Jingyi Yang
University of Science and Technology of China
Computer VisionDeep LearningAI AgentGenerative ModelsReinforcement Learning
B
Boyi Lin
Zhejiang University of Technology
Jianwei Zheng
Jianwei Zheng
Computer Science, Zhejiang University of Technology
Machine Learning