🤖 AI Summary
To address the challenge of timely detection of subtle vehicle damages—particularly undercarriage dents and scratches—in shared and rental fleets, this paper proposes a real-time anomaly detection method based on a multimodal autoencoder. Unlike vision-centric approaches, our method innovatively fuses onboard IMU vibration signals and microphone-acquired acoustic signals, employing a joint feature-level fusion and decision-level ensemble pooling architecture for end-to-end learning. This design enables low-resource, real-time deployment without expensive cameras. Evaluated on real-world fleet data, the model achieves 92% ROC-AUC, substantially outperforming unimodal baselines and state-of-the-art methods, demonstrating high sensitivity and robustness to minute structural anomalies. The framework is readily extensible to broader automotive safety applications, including collision perception and airbag trigger prediction.
📝 Abstract
Wear and tear detection in fleet and shared vehicle systems is a critical challenge, particularly in rental and car-sharing services, where minor damage, such as dents, scratches, and underbody impacts, often goes unnoticed or is detected too late. Currently, manual inspection methods are the default approach but are labour intensive and prone to human error. In contrast, state-of-the-art image-based methods struggle with real-time performance and are less effective at detecting underbody damage due to limited visual access and poor spatial coverage. This work introduces a novel multi-modal architecture based on anomaly detection to address these issues. Sensors such as IMUs and microphones are integrated into a compact device mounted on the vehicle's windshield. This approach supports real-time damage detection while avoiding the need for highly resource-intensive sensors. We developed multiple variants of multi-modal autoencoder-based architectures and evaluated them against unimodal and state-of-the-art methods. Our ensemble pooling multi-modal model achieved the highest performance, with a Receiver Operating Characteristic-Area Under Curve (ROC-AUC) of 92%, demonstrating its effectiveness in real-world applications. This approach can also be extended to other applications, such as improving automotive safety - where it can integrate with airbag systems for efficient deployment - and helping autonomous vehicles by complementing other sensors in collision detection.