Difference Decomposition Networks for Infrared Small Target Detection

📅 2025-12-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Infrared small target detection (ISTD) faces two major challenges: severe lack of target texture and heavy background clutter. To address these, this paper proposes a basis-decomposition-based feature disentanglement framework, which systematically introduces a novel difference decomposition mechanism. Specifically, we design four components: a scalable Basis Decomposition Module (BDM), a Spatial Difference Decomposition Module (SD²M), a Spatial Difference Decomposition Downsampling Module (SD³M), and a Temporal Difference Decomposition Module (TD²M), enabling simultaneous target feature enhancement and background interference suppression. Our method integrates an improved U-shaped architecture with explicit motion information modeling, supporting both single-frame and multi-frame detection. Evaluated on the SISTD and MISTD benchmarks, our approach achieves state-of-the-art performance: STD²Net attains 87.68% mIoU on the multi-frame task—substantially outperforming the single-frame SD²Net (64.97%)—demonstrating the effectiveness and generalizability of difference decomposition for feature disentanglement.

Technology Category

Application Category

📝 Abstract
Infrared small target detection (ISTD) faces two major challenges: a lack of discernible target texture and severe background clutter, which results in the background obscuring the target. To enhance targets and suppress backgrounds, we propose the Basis Decomposition Module (BDM) as an extensible and lightweight module based on basis decomposition, which decomposes a complex feature into several basis features and enhances certain information while eliminating redundancy. Extending BDM leads to a series of modules, including the Spatial Difference Decomposition Module (SD$^mathrm{2}$M), Spatial Difference Decomposition Downsampling Module (SD$^mathrm{3}$M), and Temporal Difference Decomposition Module (TD$^mathrm{2}$M). Based on these modules, we develop the Spatial Difference Decomposition Network (SD$^mathrm{2}$Net) for single-frame ISTD (SISTD) and the Spatiotemporal Difference Decomposition Network (STD$^mathrm{2}$Net) for multi-frame ISTD (MISTD). SD$^mathrm{2}$Net integrates SD$^mathrm{2}$M and SD$^mathrm{3}$M within an adapted U-shaped architecture. We employ TD$^mathrm{2}$M to introduce motion information, which transforms SD$^mathrm{2}$Net into STD$^mathrm{2}$Net. Extensive experiments on SISTD and MISTD datasets demonstrate state-of-the-art (SOTA) performance. On the SISTD task, SD$^mathrm{2}$Net performs well compared to most established networks. On the MISTD datasets, STD$^mathrm{2}$Net achieves a mIoU of 87.68%, outperforming SD$^mathrm{2}$Net, which achieves a mIoU of 64.97%. Our codes are available: https://github.com/greekinRoma/IRSTD_HC_Platform.
Problem

Research questions and friction points this paper is trying to address.

Enhancing infrared small targets lacking texture amidst severe background clutter.
Proposing basis decomposition modules to separate and enhance target features.
Developing networks for single-frame and multi-frame infrared target detection.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Basis Decomposition Module enhances targets and suppresses backgrounds.
Spatial Difference Decomposition Network uses U-shaped architecture for detection.
Temporal Difference Decomposition Module adds motion for multi-frame detection.
🔎 Similar Papers
No similar papers found.
Chen Hu
Chen Hu
School of Artificial Intelligence and Computer Science, Jiangnan University
Geometric Deep LearningMachine Learning
M
Mingyu Zhou
School of Information and Communication Engineering and the Laboratory of Imaging Detection and Intelligent Perception, University of Electronic Science and Technology of China, Chengdu 610054, China
S
Shuai Yuan
School of Instrument Science and Opto-Electronics Engineering, Hefei University of Technology, Hefei, China
H
Hongbo Hu
School of Information and Communication Engineering and the Laboratory of Imaging Detection and Intelligent Perception, University of Electronic Science and Technology of China, Chengdu 610054, China
X
Xiangyu Qiu
School of Information and Communication Engineering and the Laboratory of Imaging Detection and Intelligent Perception, University of Electronic Science and Technology of China, Chengdu 610054, China
Junhai Luo
Junhai Luo
School of Information and Communication Engineering and the Laboratory of Imaging Detection and Intelligent Perception, University of Electronic Science and Technology of China, Chengdu 610054, China
Tian Pu
Tian Pu
Senior Engineer, School of Information and Communication Engineering, UESTC
image processing,infrared small target detection
X
Xiyin Li
School of Intelligent Systems Engineering, the Sun Yat-sen University, Shenzhen, China