š¤ AI Summary
This work addresses the challenge of generalizing 3D object detection in autonomous driving when sensor configurations differ between training and testing, particularly involving RGB images and LiDAR point clouds. The authors propose LCF3D, a novel framework that integrates late fusion with cascaded fusion strategies: it leverages 2Dā3D detection matching and late fusion to suppress LiDAR false positives, while simultaneously generating 3D frustum proposals from unmatched RGB detections to recover missed objects. Evaluated on the KITTI and nuScenes datasets, LCF3D significantly outperforms LiDAR-only methods, demonstrating superior performance especially on challenging classes such as pedestrians, cyclists, and motorcycles. The approach achieves robust, real-time, and cross-domain generalizable multimodal 3D object detection.