SuperQuadricOcc: Multi-Layer Gaussian Approximation of Superquadrics for Real-Time Self-Supervised Occupancy Estimation

📅 2025-11-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In autonomous driving semantic occupancy estimation, Gaussian representations suffer from high memory consumption and slow inference, while superquadrics—despite their compactness—are hindered by the lack of differentiable rasterizers, preventing self-supervised training. To address this, this work pioneers the integration of superquadrics into self-supervised occupancy modeling. We propose a multi-level icosahedral subdivision scheme to approximate superquadrics with differentiable Gaussians, enabling end-to-end optimization via differentiable rendering. Coupled with a lightweight voxelization module and a self-supervised training framework, our approach significantly reduces representational complexity. On the Occ3D benchmark, it achieves an 84% reduction in primitive count, 75% memory compression, 124% inference speedup, and a 5.9% improvement in mIoU—outperforming all existing methods across all metrics.

Technology Category

Application Category

📝 Abstract
Semantic occupancy estimation enables comprehensive scene understanding for automated driving, providing dense spatial and semantic information essential for perception and planning. While Gaussian representations have been widely adopted in self-supervised occupancy estimation, the deployment of a large number of Gaussian primitives drastically increases memory requirements and is not suitable for real-time inference. In contrast, superquadrics permit reduced primitive count and lower memory requirements due to their diverse shape set. However, implementation into a self-supervised occupancy model is nontrivial due to the absence of a superquadric rasterizer to enable model supervision. Our proposed method, SuperQuadricOcc, employs a superquadric-based scene representation. By leveraging a multi-layer icosphere-tessellated Gaussian approximation of superquadrics, we enable Gaussian rasterization for supervision during training. On the Occ3D dataset, SuperQuadricOcc achieves a 75% reduction in memory footprint, 124% faster inference, and a 5.9% improvement in mIoU compared to previous Gaussian-based methods, without the use of temporal labels. To our knowledge, this is the first occupancy model to enable real-time inference while maintaining competitive performance. The use of superquadrics reduces the number of primitives required for scene modeling by 84% relative to Gaussian-based approaches. Finally, evaluation against prior methods is facilitated by our fast superquadric voxelization module. The code will be released as open source.
Problem

Research questions and friction points this paper is trying to address.

Addresses high memory usage in self-supervised occupancy estimation methods
Enables real-time inference by reducing primitive count with superquadrics
Solves lack of superquadric rasterizer for model supervision in training
Innovation

Methods, ideas, or system contributions that make the work stand out.

Superquadrics replace Gaussians to reduce primitive count
Multi-layer icosphere tessellation approximates superquadrics for rasterization
Fast superquadric voxelization enables real-time occupancy inference
🔎 Similar Papers
No similar papers found.
S
Seamie Hayes
Department of Electronic and Computer Engineering, the Research Ireland Centre for Research Training in Foundations in Data Science, and the Data Driven Computer Engineering (D²iCE) Research Centre, University of Limerick, Limerick, V94 T9PX Ireland
R
Reenu Mohandas
Department of Electronic and Computer Engineering, and the Data Driven Computer Engineering (D²iCE) Research Centre, University of Limerick, Limerick, V94 T9PX, Ireland
Tim Brophy
Tim Brophy
University of Galway
Alexandre Boulch
Alexandre Boulch
Senior researcher at valeo.ai
Computer sciencecomputational geometrycomputer vision
Ganesh Sistu
Ganesh Sistu
Principal Artificial Intelligence Architect, Valeo Ireland
Autonomous DrivingMachine LearningComputer VisionDeep Learning
C
Ciaran Eising
Department of Electronic and Computer Engineering, the Research Ireland Centre for Research Training in Foundations in Data Science, and the Data Driven Computer Engineering (D²iCE) Research Centre, all hosted in the University of Limerick, Limerick, V94 T9PX Ireland