Dual-Integrated Low-Latency Single-Lens Infrared Computational Imaging for Object Detection

📅 2026-05-20
📈 Citations: 0
Influential: 0
📄 PDF

career value

215K/year
🤖 AI Summary
This work addresses the high latency in joint image reconstruction and object detection for infrared computational imaging, where existing acceleration methods struggle to balance accuracy and speed due to neglecting optical physical priors. To overcome this, we propose PDI-Net, a novel framework that, during inference, bypasses full reconstruction by directly sharing features from a lightweight half-U-Net encoder to a YOLO detector. We introduce a physics-aware large-to-small bridging module (PALS-Bridge) that adaptively modulates multi-scale convolutions using field-of-view-dependent point spread function priors. Furthermore, a physics-driven optical degradation simulation pipeline is developed for training. Evaluated on the low-SNR M3FD benchmark, PDI-Net reduces inference time by 84.06% and improves mAP@0.5:0.95 by 5.07% compared to a pruned Rec+Det baseline, while also cutting system weight by approximately 50%.
📝 Abstract
Computational imaging enables compact infrared systems, but deep-learning pipelines that combine image reconstruction and object detection often introduce substantial inference latency. Most existing acceleration strategies compress the reconstruction network while overlooking physical priors from the optical path, leaving a trade-off between accuracy and speed. We present Physics-aware Dual-Integrated Network (PDI-Net), a low-latency framework that integrates infrared reconstruction with object detection and further embeds optical priors into the learning process. PDI-Net uses a supervised U-Net during training, while a semi-U-Net encoder shares features directly with a YOLO-based detector during inference, avoiding full image reconstruction. To bridge the gap between fidelity-oriented reconstruction features and detection-oriented semantics, we introduce a physics-aware large-small bridge (PALS-Bridge), which uses field-dependent point spread function priors to adaptively modulate multiscale convolutional branches. A physics-informed optical degradation simulation pipeline is also developed for training and validation. The method is deployed on a single-lens infrared camera, reducing system weight by about 50% compared with traditional multi-lens designs. On the M3FD benchmark under low-SNR conditions, PDI-Net reduces inference time by 84.06% compared with the Rec+Det with pruning strategy while improving mAP@0.5:0.95 by 5.07%. These results demonstrate compact, low-latency computational infrared imaging for real-time object detection on resource-constrained platforms.
Problem

Research questions and friction points this paper is trying to address.

infrared computational imaging
object detection
inference latency
image reconstruction
physical priors
Innovation

Methods, ideas, or system contributions that make the work stand out.

computational imaging
physics-aware learning
low-latency inference
infrared object detection
optical priors
🔎 Similar Papers
No similar papers found.
X
Xuquan Wang
MOE Key Laboratory of Advanced Micro-Structured Materials, the Institute of Precision Optical Engineering, School of Physics Science and Engineering, Tongji University, and the Shanghai Frontiers Science Center of Digital Optics, Shanghai 200092, China
G
Guishuo Yang
MOE Key Laboratory of Advanced Micro-Structured Materials, the Institute of Precision Optical Engineering, School of Physics Science and Engineering, Tongji University, and the Shanghai Frontiers Science Center of Digital Optics, Shanghai 200092, China
D
Dapeng Yan
MOE Key Laboratory of Advanced Micro-Structured Materials, the Institute of Precision Optical Engineering, School of Physics Science and Engineering, Tongji University, and the Shanghai Frontiers Science Center of Digital Optics, Shanghai 200092, China
Y
Yujie Xing
MOE Key Laboratory of Advanced Micro-Structured Materials, the Institute of Precision Optical Engineering, School of Physics Science and Engineering, Tongji University, and the Shanghai Frontiers Science Center of Digital Optics, Shanghai 200092, China
X
Xuanyu Qian
MOE Key Laboratory of Advanced Micro-Structured Materials, the Institute of Precision Optical Engineering, School of Physics Science and Engineering, Tongji University, and the Shanghai Frontiers Science Center of Digital Optics, Shanghai 200092, China
K
Kai Zhang
School of Computer Science and Artificial Intelligence, Shandong Normal University, Jinan 250358, China, and the Shandong Engineering Research Center for Multimodal Computing and Intelligent Decision Making, Shandong Normal University, Jinan 250358, China
Xiong Dun
Xiong Dun
Tongji University
computational imagingoptical designnano fabricationDeep learning in imaging
J
Jiande Sun
School of Computer Science and Artificial Intelligence, Shandong Normal University, Jinan 250358, China, and the Shandong Engineering Research Center for Multimodal Computing and Intelligent Decision Making, Shandong Normal University, Jinan 250358, China
Z
Zhanshan Wang
MOE Key Laboratory of Advanced Micro-Structured Materials, the Institute of Precision Optical Engineering, School of Physics Science and Engineering, Tongji University, and the Shanghai Frontiers Science Center of Digital Optics, Shanghai 200092, China; also with the Shanghai Institute of Intelligent Science and Technology, Tongji University, Shanghai 200092, China
Xinbin Cheng
Xinbin Cheng
Tongji University
Optical coatingsFunctional surfaces