🤖 AI Summary
Existing LiDAR point cloud generation methods primarily focus on unconditional synthesis, failing to effectively leverage semantic priors and thus inadequately supporting data augmentation for downstream perception tasks. To address this, we propose SG-LDM—the first semantic-guided LiDAR diffusion model—which enables high-fidelity, controllable generation directly in the native point cloud space via a latent-space-aligned, semantic-conditioned diffusion mechanism. Furthermore, we introduce an end-to-end diffusion-based cross-domain translation framework that jointly integrates semantic conditional generation with domain-adaptive data augmentation. Experiments demonstrate that SG-LDM achieves state-of-the-art performance in generation quality; moreover, our translation framework significantly improves downstream task performance—e.g., LiDAR point cloud segmentation—establishing a novel paradigm for semantic-driven autonomous driving perception systems.
📝 Abstract
Lidar point cloud synthesis based on generative models offers a promising solution to augment deep learning pipelines, particularly when real-world data is scarce or lacks diversity. By enabling flexible object manipulation, this synthesis approach can significantly enrich training datasets and enhance discriminative models. However, existing methods focus on unconditional lidar point cloud generation, overlooking their potential for real-world applications. In this paper, we propose SG-LDM, a Semantic-Guided Lidar Diffusion Model that employs latent alignment to enable robust semantic-to-lidar synthesis. By directly operating in the native lidar space and leveraging explicit semantic conditioning, SG-LDM achieves state-of-the-art performance in generating high-fidelity lidar point clouds guided by semantic labels. Moreover, we propose the first diffusion-based lidar translation framework based on SG-LDM, which enables cross-domain translation as a domain adaptation strategy to enhance downstream perception performance. Systematic experiments demonstrate that SG-LDM significantly outperforms existing lidar diffusion models and the proposed lidar translation framework further improves data augmentation performance in the downstream lidar segmentation task.