Alignment-Aware and Reliability-Gated Multimodal Fusion for Unmanned Aerial Vehicle Detection Across Heterogeneous Thermal-Visual Sensors

📅 2026-03-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of effectively fusing heterogeneous thermal and visible-spectrum sensors for reliable drone detection, given their disparities in resolution, viewpoint, and field of view. To this end, two multimodal fusion strategies are proposed: RGIF, which leverages ECC registration and guided filtering, and RGMAF, which integrates affine/optical-flow alignment with a reliability-weighted attention mechanism. By incorporating alignment-awareness and reliability gating, the methods adaptively combine the high contrast of thermal imagery with the rich detail of visible data, thereby mitigating poor spatial correspondence and annotation inconsistencies. Evaluated on the MMFW-UAV dataset, RGIF achieves a mAP@50 of 97.65%, while RGMAF attains the highest recall of 98.64%, both significantly outperforming single-modality baselines.

Technology Category

Application Category

📝 Abstract
Reliable unmanned aerial vehicle (UAV) detection is critical for autonomous airspace monitoring but remains challenging when integrating sensor streams that differ substantially in resolution, perspective, and field of view. Conventional fusion methods-such as wavelet-, Laplacian-, and decision-level approaches-often fail to preserve spatial correspondence across modalities and suffer from annotation of inconsistencies, limiting their robustness in real-world settings. This study introduces two fusion strategies, Registration-aware Guided Image Fusion (RGIF) and Reliability-Gated Modality-Attention Fusion (RGMAF), designed to overcome these limitations. RGIF employs Enhanced Correlation Coefficient (ECC)-based affine registration combined with guided filtering to maintain thermal saliency while enhancing structural detail. RGMAF integrates affine and optical-flow registration with a reliability-weighted attention mechanism that adaptively balances thermal contrast and visual sharpness. Experiments were conducted on the Multi-Sensor and Multi-View Fixed-Wing (MMFW)-UAV dataset comprising 147,417 annotated air-to-air frames collected from infrared, wide-angle, and zoom sensors. Among single-modality detectors, YOLOv10x demonstrated the most stable cross-domain performance and was selected as the detection backbone for evaluating fused imagery. RGIF improved the visual baseline by 2.13% mAP@50 (achieving 97.65%), while RGMAF attained the highest recall of 98.64%. These findings show that registration-aware and reliability-adaptive fusion provides a robust framework for integrating heterogeneous modalities, substantially enhancing UAV detection performance in multimodal environments.
Problem

Research questions and friction points this paper is trying to address.

UAV detection
heterogeneous sensors
multimodal fusion
spatial alignment
thermal-visual integration
Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal fusion
registration-aware
reliability-gated attention
heterogeneous sensors
UAV detection
🔎 Similar Papers
No similar papers found.
Ishrat Jahan
Ishrat Jahan
East West university
Computer ScienceCybersecurityArtificial IntelligenceData ScienceDeep Learning
M
Molla E Majid
Computer Applications Department, Academic Bridge Program, Qatar Foundation, Doha, Qatar
M Murugappan
M Murugappan
Professor, Kuwait College of Science and Technology
Affective ComputingArtificial IntelligenceBioSignal/Image Processing
M
Muhammad E. H. Chowdhury
Department of Electrical Engineering, Qatar University, Doha 2713, Qatar
N
N. B. Prakash
School of Computing Science and Engineering, Vellore Institute of Technology Bhopal University, Bhopal, India
S
Saad Bin Abul Kashem
Department of Computing Science, AFG College with the University of Aberdeen, Doha, Qatar
B
Balamurugan Balusamy
School of Engineering and IT, Manipal Academy of Higher Education, Dubai Campus, Dubai, UAE
A
Amith Khandakar
Department of Electrical Engineering, Qatar University, Doha 2713, Qatar