ThermoStereoRT: Thermal Stereo Matching in Real Time via Knowledge Distillation and Attention-based Refinement

๐Ÿ“… 2025-04-10
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the challenge of real-time disparity estimation in thermal binocular stereo vision under low-texture and low-contrast conditions. Methodologically, we propose an efficient and accurate framework featuring a lightweight backbone for 3D cost volume construction, a novel multi-scale channel-spatial joint attention mechanism, andโ€”cruciallyโ€”the first knowledge distillation strategy specifically designed for sparse thermal ground-truth disparity maps to enhance generalization. We further introduce a channel-spatial collaborative attention refinement module to significantly improve feature discriminability. Evaluated on multiple thermal stereo benchmarks, our method achieves >30 FPS real-time inference while surpassing state-of-the-art accuracy. It demonstrates strong robustness in all-weather applications, including nighttime UAV inspection and confined-space cleaning robots.

Technology Category

Application Category

๐Ÿ“ Abstract
We introduce ThermoStereoRT, a real-time thermal stereo matching method designed for all-weather conditions that recovers disparity from two rectified thermal stereo images, envisioning applications such as night-time drone surveillance or under-bed cleaning robots. Leveraging a lightweight yet powerful backbone, ThermoStereoRT constructs a 3D cost volume from thermal images and employs multi-scale attention mechanisms to produce an initial disparity map. To refine this map, we design a novel channel and spatial attention module. Addressing the challenge of sparse ground truth data in thermal imagery, we utilize knowledge distillation to boost performance without increasing computational demands. Comprehensive evaluations on multiple datasets demonstrate that ThermoStereoRT delivers both real-time capacity and robust accuracy, making it a promising solution for real-world deployment in various challenging environments. Our code will be released on https://github.com/SJTU-ViSYS-team/ThermoStereoRT
Problem

Research questions and friction points this paper is trying to address.

Real-time thermal stereo matching for all-weather conditions
Addressing sparse ground truth data via knowledge distillation
Enhancing disparity accuracy with attention-based refinement modules
Innovation

Methods, ideas, or system contributions that make the work stand out.

Lightweight backbone for real-time thermal stereo matching
Multi-scale attention mechanisms for initial disparity map
Knowledge distillation to enhance performance efficiently
๐Ÿ”Ž Similar Papers
No similar papers found.
Anning Hu
Anning Hu
Professor of Sociology, Fudan University, Shanghai, China
InequalityCultureMethodology
A
Ang Li
Shanghai Key Laboratory of Navigation and Location-based Service, Shanghai Jiao Tong University
X
Xirui Jin
Shanghai Key Laboratory of Navigation and Location-based Service, Shanghai Jiao Tong University
Danping Zou
Danping Zou
Professor, Shanghai Jiao Tong University
Visual SLAMRobotic VisionVision-based navigation