DVD: Discrete Voxel Diffusion for 3D Generation and Editing

📅 2026-05-08

📈 Citations: 0

✨ Influential: 0

career value

185K/year

🤖 AI Summary

Existing 3D generation methods rely on continuous diffusion and post-hoc thresholding for sparse voxel modeling, which hinders efficient editing and uncertainty quantification. This work proposes DVD, a discrete voxel diffusion framework that, for the first time, directly applies discrete diffusion to model 3D sparse voxel priors by treating voxel occupancy as a native discrete variable. DVD introduces predictive entropy as a principled measure of uncertainty and incorporates a lightweight block-wise perturbation fine-tuning strategy to enable single-step, efficient editing. Without incurring additional computational overhead, the method substantially improves generation quality and interpretability while supporting sample-level quality assessment and data filtering.

📝 Abstract

We introduce Discrete Voxel Diffusion (DVD), a discrete diffusion framework to generate, assess, and edit sparse voxels for SLat (Structured LATent) based 3D generative pipelines. Although discrete diffusion has not generally displaced continuous diffusion in image-like generation, we show that it can be an effective first-stage prior for sparse voxel scaffolds. By treating voxel occupancy as a native discrete variable, DVD avoids continuous-to-discrete thresholding and provides a simple framework for voxel generation, uncertainty estimation, and editing. Beyond quality gains, DVD provides more interpretable generation dynamics through explicit categorical modeling. Furthermore, we leverage the predictive entropy as a robust uncertainty metric to identify ambiguous voxel regions and complicated samples, facilitating tasks such as data filtering and quality assessment. Finally, we propose a lightweight fine-tuning strategy using block-structured perturbation patterns. This approach empowers the model to inpaint and edit voxels within a single sampling round, requiring negligible auxiliary computation and no additional model evaluations.

Problem

Research questions and friction points this paper is trying to address.

Discrete Diffusion

3D Generation

Sparse Voxels

Uncertainty Estimation

Voxel Editing

Innovation

Methods, ideas, or system contributions that make the work stand out.

Discrete Voxel Diffusion

Sparse Voxels

Uncertainty Estimation