Knowledge-Informed Neural Network for Complex-Valued SAR Image Recognition

📅 2025-10-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the “representation trilemma”—the simultaneous difficulty in achieving generalizability, interpretability, and computational efficiency—in complex-valued SAR image recognition under data scarcity and domain shift, this paper proposes a knowledge-driven lightweight neural network framework. Methodologically, we design a “compress–aggregate–compress” architecture that integrates electromagnetic scattering priors via a dictionary processor, couples a compact unfolding network with a hybrid ViT/CNN backbone, and employs a self-distillation classification head—enabling physics-guided sparse feature disentanglement and semantic compression. Evaluated on five SAR benchmarks, our model achieves state-of-the-art performance with only 0.7M–0.95M parameters. It demonstrates superior generalization in few-shot and out-of-distribution settings, offers interpretable physical reasoning, and maintains feasibility for edge deployment.

Technology Category

Application Category

📝 Abstract
Deep learning models for complex-valued Synthetic Aperture Radar (CV-SAR) image recognition are fundamentally constrained by a representation trilemma under data-limited and domain-shift scenarios: the concurrent, yet conflicting, optimization of generalization, interpretability, and efficiency. Our work is motivated by the premise that the rich electromagnetic scattering features inherent in CV-SAR data hold the key to resolving this trilemma, yet they are insufficiently harnessed by conventional data-driven models. To this end, we introduce the Knowledge-Informed Neural Network (KINN), a lightweight framework built upon a novel "compression-aggregation-compression" architecture. The first stage performs a physics-guided compression, wherein a novel dictionary processor adaptively embeds physical priors, enabling a compact unfolding network to efficiently extract sparse, physically-grounded signatures. A subsequent aggregation module enriches these representations, followed by a final semantic compression stage that utilizes a compact classification head with self-distillation to learn maximally task-relevant and discriminative embeddings. We instantiate KINN in both CNN (0.7M) and Vision Transformer (0.95M) variants. Extensive evaluations on five SAR benchmarks confirm that KINN establishes a state-of-the-art in parameter-efficient recognition, offering exceptional generalization in data-scarce and out-of-distribution scenarios and tangible interpretability, thereby providing an effective solution to the representation trilemma and offering a new path for trustworthy AI in SAR image analysis.
Problem

Research questions and friction points this paper is trying to address.

Resolves SAR image recognition's generalization-interpretability-efficiency trilemma
Leverages electromagnetic scattering features neglected by data-driven models
Enables parameter-efficient recognition in data-limited and domain-shift scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

Physics-guided compression embeds electromagnetic scattering priors
Aggregation module enriches sparse physically-grounded signatures
Self-distillation classification head learns discriminative embeddings
🔎 Similar Papers
No similar papers found.
H
Haodong Yang
School of Automation, Northwestern Polytechnical University, Xi’an, China
Zhongling Huang
Zhongling Huang
School of Automation, Northwestern Polytechnical University, Xi’an, China; Shenzhen Research Institute of Northwestern Polytechnical University, Shenzhen, China
S
Shaojie Guo
School of Automation, Northwestern Polytechnical University, Xi’an, China
Z
Zhe Zhang
Aerospace Information Technology University, Jinan, China; the Suzhou Aerospace Information Research Institute, Suzhou, China; the National Key Laboratory of Microwave Imaging, Beijing, China; the Aerospace Information Research Institute, CAS, Beijing, China; the School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing, China
Gong Cheng
Gong Cheng
Professor, Nanjing University
big data searchknowledge graphLLM inference
J
Junwei Han
School of Automation, Northwestern Polytechnical University, Xi’an, China; School of Artificial Intelligence, Chongqing University of Posts and Telecommunications, Chongqing, China