Exploiting Gaussian Agnostic Representation Learning with Diffusion Priors for Enhanced Infrared Small Target Detection

📅 2025-07-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the robustness deficiency of infrared small target detection (ISTD) in real-world scenarios caused by scarce high-quality annotated data, this paper proposes a novel framework integrating Gaussian-agnostic representation learning with diffusion-based priors. Our method comprises two key components: (1) a Gaussian group compressor that performs non-uniform quantization to preserve critical structural details, and (2) a two-stage diffusion model—first introducing diffusion priors into synthetic sample reconstruction to better approximate the underlying distribution of real infrared data. Under data-constrained conditions, the framework significantly enhances model generalization and environmental adaptability. Experiments demonstrate superior synthetic sample fidelity and detection accuracy over state-of-the-art methods, achieving SOTA performance across multiple data-scarce settings. These results empirically validate the proposed method’s robustness and practical efficacy.

Technology Category

Application Category

📝 Abstract
Infrared small target detection (ISTD) plays a vital role in numerous practical applications. In pursuit of determining the performance boundaries, researchers employ large and expensive manual-labeling data for representation learning. Nevertheless, this approach renders the state-of-the-art ISTD methods highly fragile in real-world challenges. In this paper, we first study the variation in detection performance across several mainstream methods under various scarcity -- namely, the absence of high-quality infrared data -- that challenge the prevailing theories about practical ISTD. To address this concern, we introduce the Gaussian Agnostic Representation Learning. Specifically, we propose the Gaussian Group Squeezer, leveraging Gaussian sampling and compression for non-uniform quantization. By exploiting a diverse array of training samples, we enhance the resilience of ISTD models against various challenges. Then, we introduce two-stage diffusion models for real-world reconstruction. By aligning quantized signals closely with real-world distributions, we significantly elevate the quality and fidelity of the synthetic samples. Comparative evaluations against state-of-the-art detection methods in various scarcity scenarios demonstrate the efficacy of the proposed approach.
Problem

Research questions and friction points this paper is trying to address.

Enhancing infrared small target detection with limited data
Improving model resilience using Gaussian agnostic representation learning
Elevating synthetic sample quality via diffusion-based reconstruction
Innovation

Methods, ideas, or system contributions that make the work stand out.

Gaussian Agnostic Representation Learning for ISTD
Gaussian Group Squeezer for non-uniform quantization
Two-stage diffusion models for real-world reconstruction
🔎 Similar Papers
No similar papers found.
J
Junyao Li
School of Information Engineering, Guangdong University of Technology, Guangzhou, 510006, China
Yahao Lu
Yahao Lu
Guangdong University of Technology
Infrared small target detection3D target detectionTransformerDiffusion.
X
Xingyuan Guo
School of Information Engineering, Guangdong University of Technology, Guangzhou, 510006, China
X
Xiaoyu Xian
CRRI Insitution, Beijing 100000, China
T
Tiantian Wang
Guangzhou National Laboratory, Guangzhou 510006, China
Y
Yukai Shi
School of Information Engineering, Guangdong University of Technology, Guangzhou, 510006, China