A Camera-Cooperative ISAC Framework for Multimodal Non-Cooperative UAVs Sensing

📅 2026-05-21
📈 Citations: 0
Influential: 0
📄 PDF

career value

194K/year
🤖 AI Summary
This work addresses the limitations of single-modality sensing and the competition for integrated sensing and communication (ISAC) resources in non-cooperative drone detection by proposing a multimodal perception framework that synergistically combines camera and ISAC systems. The framework establishes a complementary perception loop wherein coarse-grained visual monitoring guides high-precision ISAC beam tracking. It introduces a novel vision-to-echo domain alignment mechanism (V2EDA), a cross-attention fusion architecture, and a multimodal state estimation algorithm (MMFE) to effectively mitigate resource contention and enhance sensing efficiency. Experimental results on the DeepSense 6G dataset demonstrate that the proposed approach reduces average beam steering overhead by 71% and decreases tracking overhead by 1.69%–11.15%, while maintaining high angular estimation accuracy.
📝 Abstract
The detection of non-cooperative unmanned aerial vehicles (UAVs) presents significant challenges for Integrated Sensing and Communication (ISAC) systems due to the inherent limitations of single-modal perception and the competition for shared communication and sensing resources. To address these challenges, this paper proposes a novel Camera-Cooperative ISAC (CC-ISAC) framework that employs multimodal sensing to enable efficient UAV beam steering and tracking. The proposed framework employs cameras for coarse-grained airspace monitoring and utilizes ISAC for fine-grained, high-precision sensing, forming a complementary perception loop that enhances both sensing accuracy and resource efficiency. Within this framework, two key modules are developed: (1) a Vision-to-Echo Data Alignment (V2EDA) model that aligns visual and echo-domain features through cross-attention mechanisms, and (2) a Multimodal Fusion-Based Estimation (MMFE) model that integrates historical multimodal data with current observations for robust state estimation. Extensive evaluations conducted on the DeepSense 6G dataset demonstrate that the proposed framework achieves an average reduction of 71% in beam steering overhead and 1.69-11.15% in tracking overhead while maintaining high angular estimation accuracy. The CC-ISAC framework effectively mitigates resource contention between sensing and communication, enabling reliable UAV surveillance while freeing substantial system resources for additional communication tasks, thereby representing a practical advancement in ISAC system design.
Problem

Research questions and friction points this paper is trying to address.

non-cooperative UAVs
Integrated Sensing and Communication
multimodal sensing
resource contention
beam steering
Innovation

Methods, ideas, or system contributions that make the work stand out.

Camera-Cooperative ISAC
Multimodal Sensing
Vision-to-Echo Data Alignment
Multimodal Fusion
Non-Cooperative UAV Detection
🔎 Similar Papers
No similar papers found.
W
Wenfeng Wu
State Key Laboratory of Novel Software Technology, Nanjing University, Nanjing, 210008, China; Institute of Intelligent Networks and Communications (NINE) and School of Intelligent Software and Engineering, Nanjing University (Suzhou Campus), Suzhou, 215163, China
Luping Xiang
Luping Xiang
Research professor @ Nanjing University
wireless communication
K
Kun Yang
State Key Laboratory of Novel Software Technology, Nanjing University, Nanjing, 210008, China; Institute of Intelligent Networks and Communications (NINE) and School of Intelligent Software and Engineering, Nanjing University (Suzhou Campus), Suzhou, 215163, China