🤖 AI Summary
To address geometric distortion and detail loss caused by nonlinear lens distortion in PTZ wide-angle cameras, this paper proposes an end-to-end single-image rectification method. Our approach introduces a novel joint learning framework that simultaneously models forward distortion and estimates backward deformation fields. We design a pyramid attention encoder and a multi-scale decoder to enhance fine-grained geometric modeling; further, we incorporate a channel-spatial joint attention mechanism and cross-level feature fusion to improve localization accuracy of distorted regions and texture recovery quality. Evaluated on public benchmarks, AirSim simulations, and real-world PTZ datasets, our method achieves state-of-the-art performance: geometric error is reduced by 23.6%, and PSNR/SSIM scores are significantly improved. Moreover, rectified images demonstrate markedly enhanced robustness in downstream vision tasks such as object detection and tracking.
📝 Abstract
Pan-Tilt-Zoom (PTZ) cameras with wide-angle lenses are widely used in surveillance but often require image rectification due to their inherent nonlinear distortions. Current deep learning approaches typically struggle to maintain fine-grained geometric details, resulting in inaccurate rectification. This paper presents a Forward Distortion and Backward Warping Network (FDBW-Net), a novel framework for wide-angle image rectification. It begins by using a forward distortion model to synthesize barrel-distorted images, reducing pixel redundancy and preventing blur. The network employs a pyramid context encoder with attention mechanisms to generate backward warping flows containing geometric details. Then, a multi-scale decoder is used to restore distorted features and output rectified images. FDBW-Net's performance is validated on diverse datasets: public benchmarks, AirSim-rendered PTZ camera imagery, and real-scene PTZ camera datasets. It demonstrates that FDBW-Net achieves SOTA performance in distortion rectification, boosting the adaptability of PTZ cameras for practical visual applications.