๐ค AI Summary
Existing 3D generative methods predominantly produce monolithic shapes, limiting interactive editing capabilities. To address this, we propose a two-stage collaborative framework for part-level controllable 3D asset generation. In the first stage, unlabeled 3D part layout planning is guided by 2D part masks to ensure structural consistency. In the second stage, an autoregressive structural planning module generates a sequence of part bounding boxes, while a pretrained spatially conditioned correction flow model jointly synthesizes geometry and texture. Our method supports user-defined part granularity and precise spatial placement, enabling unified modeling of semantic disentanglement and structural coherence. Extensive evaluations on multiple benchmarks demonstrate state-of-the-art performance in both fidelity and diversity, with significant improvements in editability and downstream applicabilityโe.g., for scene composition and iterative design.
๐ Abstract
The creation of 3D assets with explicit, editable part structures is crucial for advancing interactive applications, yet most generative methods produce only monolithic shapes, limiting their utility. We introduce OmniPart, a novel framework for part-aware 3D object generation designed to achieve high semantic decoupling among components while maintaining robust structural cohesion. OmniPart uniquely decouples this complex task into two synergistic stages: (1) an autoregressive structure planning module generates a controllable, variable-length sequence of 3D part bounding boxes, critically guided by flexible 2D part masks that allow for intuitive control over part decomposition without requiring direct correspondences or semantic labels; and (2) a spatially-conditioned rectified flow model, efficiently adapted from a pre-trained holistic 3D generator, synthesizes all 3D parts simultaneously and consistently within the planned layout. Our approach supports user-defined part granularity, precise localization, and enables diverse downstream applications. Extensive experiments demonstrate that OmniPart achieves state-of-the-art performance, paving the way for more interpretable, editable, and versatile 3D content.