3D Can Be Explored In 2D : Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation

📅 2024-06-02

🏛️ 2024 IEEE Intelligent Vehicles Symposium (IV)

📈 Citations: 0

✨ Influential: 0

career value

175K/year

🤖 AI Summary

To address the challenges of heavy reliance on scarce 3D annotations and susceptibility to domain shift in 3D LiDAR point cloud semantic segmentation, this paper proposes a purely 2D-driven unsupervised pseudo-label generation framework. Specifically, LiDAR point clouds are rendered into 2D bird’s-eye-view (BEV) and front-view (FV) projections based on sensor intensity; pre-trained 2D semantic segmentation models are then applied to generate initial pseudo-labels. These are refined via differentiable backward projection and cross-view majority voting to yield high-confidence 3D point-level annotations. Crucially, the method eliminates all dependence on 3D ground truth labels and multimodal inputs (e.g., RGB images), establishing the first end-to-end 2D→3D pseudo-labeling pipeline. Extensive experiments across multiple LiDAR datasets demonstrate substantial improvements over state-of-the-art unsupervised domain adaptation baselines. Ablation studies confirm the effectiveness and necessity of each component.

Technology Category

Application Category

📝 Abstract

Semantic segmentation of 3D LiDAR point clouds, essential for autonomous driving and infrastructure management, is best achieved by supervised learning, which demands extensive annotated datasets and faces the problem of domain shifts. We introduce a new 3D semantic segmentation pipeline that leverages aligned scenes and state-of-the-art 2D segmentation methods, avoiding the need for direct 3D annotation or reliance on additional modalities such as camera images at inference time. Our approach generates 2D views from LiDAR scans colored by sensor intensity and applies 2D semantic segmentation to these views using a camera-domain pretrained model. The segmented 2D outputs are then back-projected onto the 3D points, with a simple voting-based estimator that merges the labels associated to each 3D point. Our main contribution is a global pipeline for 3D semantic segmentation requiring no prior 3D annotation and not other modality for inference, which can be used for pseudo-label generation. We conduct a thorough ablation study and demonstrate the potential of the generated pseudo-labels for the Unsupervised Domain Adaptation task.

Problem

Research questions and friction points this paper is trying to address.

3D LiDAR point cloud semantic segmentation without direct 3D annotation

Domain shift challenges in supervised learning for autonomous driving

Pseudo-label generation using 2D views and sensor-intensity data

Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses 2D views from LiDAR intensity data

Applies camera-pretrained 2D segmentation model

Back-projects labels to 3D via voting estimator

🔎 Similar Papers

No similar papers found.