Accurate and lightweight dehazing via multi-receptive-field non-local network and novel contrastive regularization

📅 2023-09-28
🏛️ arXiv.org
📈 Citations: 2
Influential: 0
📄 PDF
🤖 AI Summary
To address weak feature representation and insufficient long-range dependency modeling in single-image dehazing, this paper proposes MRFNLN—a lightweight and efficient model. Methodologically, it introduces three key innovations: (1) a Multi-Scale Receptive Field Attention Block (MSFAB) and a Cross-Branch Non-Local Block (CNLB) that jointly capture multi-scale local structures and global contextual information; (2) Detail-Focused Contrastive Regularization (DFCR), which explicitly enforces fidelity of low-level details; and (3) Spatial Pyramid Downsampling (SPDS) to enhance multi-scale feature aggregation efficiency. With only 1.48M parameters, MRFNLN achieves state-of-the-art PSNR and SSIM scores on benchmark datasets including SOTS and O-HAZE, while maintaining superior inference speed and memory efficiency—demonstrating an effective balance among accuracy, computational cost, and resource consumption.
📝 Abstract
Recently, deep learning-based methods have dominated image dehazing domain. Although very competitive dehazing performance has been achieved with sophisticated models, effective solutions for extracting useful features are still under-explored. In addition, non-local network, which has made a breakthrough in many vision tasks, has not been appropriately applied to image dehazing. Thus, a multi-receptive-field non-local network (MRFNLN) consisting of the multi-stream feature attention block (MSFAB) and cross non-local block (CNLB) is presented in this paper. We start with extracting richer features for dehazing. Specifically, we design a multi-stream feature extraction (MSFE) sub-block, which contains three parallel convolutions with different receptive fields (i.e., $1 imes 1$, $3 imes 3$, $5 imes 5$) for extracting multi-scale features. Following MSFE, we employ an attention sub-block to make the model adaptively focus on important channels/regions. The MSFE and attention sub-blocks constitute our MSFAB. Then, we design a cross non-local block (CNLB), which can capture long-range dependencies beyond the query. Instead of the same input source of query branch, the key and value branches are enhanced by fusing more preceding features. CNLB is computation-friendly by leveraging a spatial pyramid down-sampling (SPDS) strategy to reduce the computation and memory consumption without sacrificing the performance. Last but not least, a novel detail-focused contrastive regularization (DFCR) is presented by emphasizing the low-level details and ignoring the high-level semantic information in the representation space. Comprehensive experimental results demonstrate that the proposed MRFNLN model outperforms recent state-of-the-art dehazing methods with less than 1.5 Million parameters.
Problem

Research questions and friction points this paper is trying to address.

Enhancing image dehazing with multi-receptive-field non-local network
Improving feature extraction via multi-stream attention and cross non-local blocks
Reducing computation with detail-focused contrastive regularization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-receptive-field non-local network for dehazing
Cross non-local block captures long-range dependencies
Detail-focused contrastive regularization enhances low-level details
🔎 Similar Papers
No similar papers found.
Zewei He
Zewei He
Zhejiang University; Huanjiang Laboratory
Image RestorationDehazingSuper-ResolutionInfrared Imaging
Z
Zi-Yuan Chen
School of Aeronautics and Astronautics, Zhejiang University
Ziqian Lu
Ziqian Lu
Zhejiang University;Zhejiang Sci-Tech University
Zero-Shot LearningMulti-modalLLMContrastive Learning
X
Xue-Cheng Sun
School of Aeronautics and Astronautics, Zhejiang University
Z
Zheming Lu
School of Aeronautics and Astronautics, Zhejiang University