π€ AI Summary
To address out-of-distribution (OOD) object misclassification in LiDAR point cloud semantic segmentation, this paper proposes an unsupervised OOD detection method requiring no auxiliary data. The method operates solely on a single forward pass of a pre-trained segmentation network, without model ensembling or multi-stage training. Its core contribution is the first construction of a hierarchical Gaussian Mixture Model (GMM) in deep feature space, coupled with Bayesian inference to explicitly disentangle epistemic uncertainty (model ignorance) from aleatoric uncertainty (data randomness)βthereby overcoming the fundamental limitation of entropy-based methods that conflate these two uncertainty sources. Evaluated on SemanticKITTI, the approach achieves substantial improvements: +18% AUROC, +22% AUPRC, and a reduction in FPR95 from 76% to 40%, significantly outperforming state-of-the-art prediction-entropy baselines.
π Abstract
In addition to accurate scene understanding through precise semantic segmentation of LiDAR point clouds, detecting out-of-distribution (OOD) objects, instances not encountered during training, is essential to prevent the incorrect assignment of unknown objects to known classes. While supervised OOD detection methods depend on auxiliary OOD datasets, unsupervised methods avoid this requirement but typically rely on predictive entropy, the entropy of the predictive distribution obtained by averaging over an ensemble or multiple posterior weight samples. However, these methods often conflate epistemic (model) and aleatoric (data) uncertainties, misclassifying ambiguous in distribution regions as OOD. To address this issue, we present an unsupervised OOD detection approach that employs epistemic uncertainty derived from hierarchical Bayesian modeling of Gaussian Mixture Model (GMM) parameters in the feature space of a deep neural network. Without requiring auxiliary data or additional training stages, our approach outperforms existing uncertainty-based methods on the SemanticKITTI dataset, achieving an 18% improvement in AUROC, 22% increase in AUPRC, and 36% reduction in FPR95 (from 76% to 40%), compared to the predictive entropy approach used in prior works.