PEFT-DML: Parameter-Efficient Fine-Tuning Deep Metric Learning for Robust Multi-Modal 3D Object Detection in Autonomous Driving

📅 2025-11-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the insufficient robustness of multimodal 3D detection in autonomous driving under sensor missingness or unseen modality combinations, this paper proposes a parameter-efficient fine-tuning framework based on deep metric learning. Methodologically, it innovatively integrates LoRA with adapter modules to align and dynamically fuse heterogeneous modalities—including LiDAR, camera, radar, IMU, and GNSS—within a shared latent space, enabling reliable detection from arbitrary subsets of input modalities for the first time. The framework significantly enhances robustness against rapid motion, adverse weather conditions, and cross-domain distribution shifts. Evaluated on the nuScenes benchmark, it achieves state-of-the-art performance in both detection accuracy and cross-scene stability, demonstrating superior generalization capability and practical applicability.

Technology Category

Application Category

📝 Abstract
This study introduces PEFT-DML, a parameter-efficient deep metric learning framework for robust multi-modal 3D object detection in autonomous driving. Unlike conventional models that assume fixed sensor availability, PEFT-DML maps diverse modalities (LiDAR, radar, camera, IMU, GNSS) into a shared latent space, enabling reliable detection even under sensor dropout or unseen modality class combinations. By integrating Low-Rank Adaptation (LoRA) and adapter layers, PEFT-DML achieves significant training efficiency while enhancing robustness to fast motion, weather variability, and domain shifts. Experiments on benchmarks nuScenes demonstrate superior accuracy.
Problem

Research questions and friction points this paper is trying to address.

Robust 3D object detection under sensor dropout
Efficient multi-modal fusion for autonomous driving
Handling unseen modality combinations and domain shifts
Innovation

Methods, ideas, or system contributions that make the work stand out.

Parameter-efficient deep metric learning framework
Maps diverse modalities into shared latent space
Integrates LoRA and adapter layers for efficiency
🔎 Similar Papers
No similar papers found.