Learning Multi-scale Spatial-frequency Features for Image Denoising

📅 2025-06-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing image denoising methods predominantly employ fixed single-scale U-Net architectures, limiting their capacity to model pixel-level multi-scale representations and failing to explicitly account for the distinct spectral characteristics of high-frequency (texture distortion) and low-frequency (blurring) noise components. To address these limitations, we propose the Multi-scale Adaptive Dual-domain Network (MADNet). Its core innovations include: (1) a learnable mask-driven Spatial-Frequency Adaptive Unit (ASFU) that enables band-aware noise modeling; and (2) an integrated design incorporating image pyramid inputs, frequency-domain decomposition, and cross-scale global feature skip connections. Extensive experiments on synthetic (SIDD, DND) and real-world noisy datasets demonstrate that MADNet consistently outperforms state-of-the-art methods—particularly in texture detail recovery and high-frequency noise suppression—with average PSNR and SSIM improvements of 0.82 dB and 0.013, respectively.

Technology Category

Application Category

📝 Abstract
Recent advancements in multi-scale architectures have demonstrated exceptional performance in image denoising tasks. However, existing architectures mainly depends on a fixed single-input single-output Unet architecture, ignoring the multi-scale representations of pixel level. In addition, previous methods treat the frequency domain uniformly, ignoring the different characteristics of high-frequency and low-frequency noise. In this paper, we propose a novel multi-scale adaptive dual-domain network (MADNet) for image denoising. We use image pyramid inputs to restore noise-free results from low-resolution images. In order to realize the interaction of high-frequency and low-frequency information, we design an adaptive spatial-frequency learning unit (ASFU), where a learnable mask is used to separate the information into high-frequency and low-frequency components. In the skip connections, we design a global feature fusion block to enhance the features at different scales. Extensive experiments on both synthetic and real noisy image datasets verify the effectiveness of MADNet compared with current state-of-the-art denoising approaches.
Problem

Research questions and friction points this paper is trying to address.

Fixed single-input single-output Unet architecture limitations
Uniform treatment of frequency domain ignoring noise differences
Lack of multi-scale representations in pixel level
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-scale adaptive dual-domain network (MADNet)
Adaptive spatial-frequency learning unit (ASFU)
Global feature fusion block in skip connections
🔎 Similar Papers
No similar papers found.
X
Xu Zhao
PCA Lab, Key Lab of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education, School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, Jiangsu, China
C
Chen Zhao
School of Intelligence Science and Technology, Nanjing University, Suzhou, 215163, Jiangsu, China
Xiantao Hu
Xiantao Hu
Nanjing University of Science & Technology
Computer VIsion
H
Hongliang Zhang
PCA Lab, Key Lab of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education, School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, Jiangsu, China
Y
Ying Tai
School of Intelligence Science and Technology, Nanjing University, Suzhou, 215163, Jiangsu, China
J
Jian Yang
PCA Lab, Key Lab of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education, School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, Jiangsu, China