neuralCAD-Edit: An Expert Benchmark for Multimodal-Instructed 3D CAD Model Editing

📅 2026-04-17

📈 Citations: 0

✨ Influential: 0

career value

201K/year

🤖 AI Summary

Existing methods struggle to handle multimodal 3D CAD editing requests posed by professional designers in real-world industrial settings. This work introduces the first multimodal instruction benchmark tailored for expert-level CAD editing, constructed by recording videos of designers simultaneously narrating, annotating, and performing edits, thereby capturing authentic tasks involving speech, gestures, sketching, and screen interactions—surpassing the limitations of purely text-based conditioning. Leveraging this dataset, we evaluate the performance gap between state-of-the-art foundation models (GPT-5.2) and human experts: in human acceptability tests, the model lags behind experts by 53% (absolute), underscoring the task’s difficulty and establishing the first realistic, industry-aligned evaluation benchmark for future research.

Technology Category

Application Category

📝 Abstract

We introduce neuralCAD-Edit, the first benchmark for editing 3D CAD models collected from expert CAD engineers. Instead of text conditioning as in prior works, we collect realistic CAD editing requests by capturing videos of professional designers, interacting directly with CAD models in CAD software, while talking, pointing and drawing. We recruited ten consenting designers to contribute to this contained study. We benchmark leading foundation models against human CAD experts carrying out edits, and find a large performance gap in both automatic metrics and human evaluations. Even the best foundation model (GPT 5.2) scores 53% lower (absolute) than CAD experts in human acceptance trials, demonstrating the challenge of neuralCAD-Edit. We hope neuralCAD-Edit will provide a solid foundation against which 3D CAD editing approaches and foundation models can be developed. Code/data: https://autodeskailab.github.io/neuralCAD-Edit

Problem

Research questions and friction points this paper is trying to address.

3D CAD editing

multimodal instruction

expert benchmark

foundation models

human performance gap

Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal instruction

3D CAD editing

expert benchmark