Efficient Medical Image Restoration via Reliability Guided Learning in Frequency Domain

📅 2025-04-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the low computational efficiency and insufficient reliability of deep learning methods in medical image restoration, this paper proposes LRformer, a lightweight frequency-domain Transformer. Methodologically, it introduces a novel reliability-guided frequency-domain learning paradigm; designs a Reliable Lesion Semantic Prior Generator (RLPP) based on Monte Carlo sampling; and proposes a Frequency-domain Guided Cross-Attention (GFCA) mechanism leveraging the conjugate symmetry property of the Fast Fourier Transform (FFT), reducing computational complexity by nearly 50%. Evaluated on low-dose CT denoising, MRI super-resolution, and artifact removal, LRformer consistently outperforms state-of-the-art methods—achieving higher PSNR and SSIM, 38% fewer parameters, and 47% lower FLOPs—while ensuring clinical safety and real-time applicability via Bayesian uncertainty quantification.

Technology Category

Application Category

📝 Abstract
Medical image restoration tasks aim to recover high-quality images from degraded observations, exhibiting emergent desires in many clinical scenarios, such as low-dose CT image denoising, MRI super-resolution, and MRI artifact removal. Despite the success achieved by existing deep learning-based restoration methods with sophisticated modules, they struggle with rendering computationally-efficient reconstruction results. Moreover, they usually ignore the reliability of the restoration results, which is much more urgent in medical systems. To alleviate these issues, we present LRformer, a Lightweight Transformer-based method via Reliability-guided learning in the frequency domain. Specifically, inspired by the uncertainty quantification in Bayesian neural networks (BNNs), we develop a Reliable Lesion-Semantic Prior Producer (RLPP). RLPP leverages Monte Carlo (MC) estimators with stochastic sampling operations to generate sufficiently-reliable priors by performing multiple inferences on the foundational medical image segmentation model, MedSAM. Additionally, instead of directly incorporating the priors in the spatial domain, we decompose the cross-attention (CA) mechanism into real symmetric and imaginary anti-symmetric parts via fast Fourier transform (FFT), resulting in the design of the Guided Frequency Cross-Attention (GFCA) solver. By leveraging the conjugated symmetric property of FFT, GFCA reduces the computational complexity of naive CA by nearly half. Extensive experimental results in various tasks demonstrate the superiority of the proposed LRformer in both effectiveness and efficiency.
Problem

Research questions and friction points this paper is trying to address.

Improving efficiency in medical image restoration tasks
Ensuring reliability of restoration results in clinical use
Reducing computational complexity in frequency domain processing
Innovation

Methods, ideas, or system contributions that make the work stand out.

Lightweight Transformer-based method in frequency domain
Reliable Lesion-Semantic Prior Producer with MC estimators
Guided Frequency Cross-Attention solver via FFT
🔎 Similar Papers
No similar papers found.
P
Pengcheng Zheng
University of Electronic Science and Technology of China, Chengdu, China
Kecheng Chen
Kecheng Chen
PhD student at EE, City University of Hong Kong
Transfer LearningAI for HealthcareSignal Processing
J
Jiaxin Huang
Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi ,The United Arab Emirates
B
Bohao Chen
University of Electronic Science and Technology of China, Chengdu, China
J
Ju Liu
University of Electronic Science and Technology of China, Chengdu, China
Y
Yazhou Ren
University of Electronic Science and Technology of China, Chengdu, China, Shenzhen Institute For Advanced Study, University of Electronic Science and Technology of China, Shenzhen, China
Xiaorong Pu
Xiaorong Pu
University of Electronic Science and Technology of China
medical image processing