MS-YOLO: Infrared Object Detection for Edge Deployment via MobileNetV4 and SlideLoss

📅 2025-09-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses three key challenges in infrared target detection for urban autonomous driving: severe class imbalance, thermal noise interference, and stringent computational constraints on edge devices. To tackle these, we propose a lightweight and efficient detection framework. Methodologically, we replace YOLOv8’s CSPDarknet backbone with MobileNetV4, reducing computational cost by 1.5% while preserving detection accuracy. Additionally, we introduce SlideLoss—a dynamic loss function that adaptively focuses on hard and occluded samples—to jointly optimize precision and recall. Evaluated on the FLIR ADAS V2 dataset, our model achieves competitive mAP—matching state-of-the-art methods—while consuming only 6.7 GFLOPs. The framework significantly improves the trade-off between detection efficiency and robustness under low-light and adverse weather conditions, demonstrating strong suitability for real-time deployment on resource-constrained edge platforms.

Technology Category

Application Category

📝 Abstract
Infrared imaging has emerged as a robust solution for urban object detection under low-light and adverse weather conditions, offering significant advantages over traditional visible-light cameras. However, challenges such as class imbalance, thermal noise, and computational constraints can significantly hinder model performance in practical settings. To address these issues, we evaluate multiple YOLO variants on the FLIR ADAS V2 dataset, ultimately selecting YOLOv8 as our baseline due to its balanced accuracy and efficiency. Building on this foundation, we present exttt{MS-YOLO} ( extbf{M}obileNetv4 and extbf{S}lideLoss based on YOLO), which replaces YOLOv8's CSPDarknet backbone with the more efficient MobileNetV4, reducing computational overhead by extbf{1.5%} while sustaining high accuracy. In addition, we introduce emph{SlideLoss}, a novel loss function that dynamically emphasizes under-represented and occluded samples, boosting precision without sacrificing recall. Experiments on the FLIR ADAS V2 benchmark show that exttt{MS-YOLO} attains competitive mAP and superior precision while operating at only extbf{6.7 GFLOPs}. These results demonstrate that exttt{MS-YOLO} effectively addresses the dual challenge of maintaining high detection quality while minimizing computational costs, making it well-suited for real-time edge deployment in urban environments.
Problem

Research questions and friction points this paper is trying to address.

Enhancing infrared object detection in low-light urban environments
Addressing class imbalance and thermal noise in thermal imaging
Reducing computational costs for real-time edge deployment
Innovation

Methods, ideas, or system contributions that make the work stand out.

Replaces CSPDarknet with efficient MobileNetV4 backbone
Introduces SlideLoss to emphasize underrepresented samples
Achieves high precision with low computational cost
🔎 Similar Papers
No similar papers found.
J
Jiali Zhang
Dept. of Mathematics & Statistics, Missouri University of Science and Technology, Rolla, MO 65409 USA
T
Thomas S. White
Dept. of Computer Science, Missouri University of Science and Technology, Rolla, MO 65409 USA
H
Haoliang Zhang
School of Electrical & Computer Engineering, University of Oklahoma, Norman, OK 73019 USA
Wenqing Hu
Wenqing Hu
Missouri University of Science and Technology (formerly University of Missouri, Rolla)
ProbabilityApplied Mathematics
D
Donald C. Wunsch II
Kummer Institute Center for Artificial Intelligence and Autonomous Systems, Missouri University of Science and Technology, Rolla, MO 65409 USA
J
Jian Liu
Kummer Institute Center for Artificial Intelligence and Autonomous Systems, Missouri University of Science and Technology, Rolla, MO 65409 USA