Point-to-Mask: From Arbitrary Point Annotations to Mask-Level Infrared Small Target Detection

πŸ“… 2026-03-17
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the challenge of infrared small target detection, which typically relies on costly pixel-level annotations due to the targets’ weak signatures and ambiguous boundaries. To alleviate this burden, we propose a Point-to-Mask framework that transforms sparse point annotations into compact masks via a physics-driven adaptive mask generation (PAMG) module. We further introduce a radius-aware point regression network (RPR-Net) to jointly optimize target center localization and effective radius estimation, integrating spatiotemporal motion cues and a pseudo-labeling closed-loop refinement strategy. To our knowledge, this is the first approach to achieve efficient mask-level detection from arbitrary point supervision. Evaluated on our newly curated SIRSTD-Pixel dataset, the method substantially reduces annotation costs while approaching full-supervision performance, achieving both high accuracy and efficient inference.

Technology Category

Application Category

πŸ“ Abstract
Infrared small target detection (IRSTD) methods predominantly formulate the task as pixel-level segmentation, which requires costly dense annotations and is not well suited to tiny targets with weak texture and ambiguous boundaries. To address this issue, we propose Point-to-Mask, a framework that bridges low-cost point supervision and mask-level detection through two components: a Physics-driven Adaptive Mask Generation (PAMG) module that converts point annotations into compact target masks and geometric cues, and a lightweight Radius-aware Point Regression Network (RPR-Net) that reformulates IRSTD as target center localization and effective radius regression using spatiotemporal motion cues. The two modules form a closed loop: PAMG generates pseudo masks and geometric supervision during training, while the geometric predictions of RPR-Net are fed back to PAMG for pixel-level mask recovery during inference. To facilitate systematic evaluation, we further construct SIRSTD-Pixel, a sequential dataset with refined pixel-level annotations. Experiments show that the proposed framework achieves strong pseudo-label quality, high detection accuracy, and efficient inference, approaching full-supervision performance under point-supervised settings with substantially lower annotation cost. Code and datasets will be available at: https://github.com/GaoScience/point-to-mask.
Problem

Research questions and friction points this paper is trying to address.

Infrared small target detection
point annotation
mask-level detection
weak texture
ambiguous boundaries
Innovation

Methods, ideas, or system contributions that make the work stand out.

Point-to-Mask
Physics-driven Adaptive Mask Generation
Radius-aware Point Regression Network
Infrared Small Target Detection
Point Supervision
W
Weihua Gao
Key Laboratory of Electronics and Information Technology for Space Systems, National Space Science Center, Chinese Academy of Sciences, Beijing 100190, China; also with the School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing 100049, China
W
Wenlong Niu
Key Laboratory of Electronics and Information Technology for Space Systems, National Space Science Center, Chinese Academy of Sciences, Beijing 100190, China
Jie Tang
Jie Tang
UW Madison
Computed Tomography
M
Man Yang
Key Laboratory of Electronics and Information Technology for Space Systems, National Space Science Center, Chinese Academy of Sciences, Beijing 100190, China
J
Jiafeng Zhang
Key Laboratory of Electronics and Information Technology for Space Systems, National Space Science Center, Chinese Academy of Sciences, Beijing 100190, China
X
Xiaodong Peng
Key Laboratory of Electronics and Information Technology for Space Systems, National Space Science Center, Chinese Academy of Sciences, Beijing 100190, China