SRU-Pix2Pix: A Fusion-Driven Generator Network for Medical Image Translation with Few-Shot Learning

📅 2026-01-08
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the limited clinical applicability of magnetic resonance imaging (MRI) due to prolonged acquisition times, high costs, and resolution constraints by proposing an enhanced Pix2Pix framework capable of high-quality medical image translation under extremely limited data regimes (<500 images). The approach innovatively integrates a Squeeze-and-Excitation Residual Network (SEResNet) with U-Net++ within the generator architecture and employs a streamlined PatchGAN discriminator, thereby strengthening channel-wise attention mechanisms and multi-scale feature fusion. Experimental results demonstrate that the proposed model significantly improves structural fidelity, image quality, and generalization across diverse MRI modality translation tasks, confirming its efficacy and novelty in data-scarce scenarios.

Technology Category

Application Category

📝 Abstract
Magnetic Resonance Imaging (MRI) provides detailed tissue information, but its clinical application is limited by long acquisition time, high cost, and restricted resolution. Image translation has recently gained attention as a strategy to address these limitations. Although Pix2Pix has been widely applied in medical image translation, its potential has not been fully explored. In this study, we propose an enhanced Pix2Pix framework that integrates Squeeze-and-Excitation Residual Networks (SEResNet) and U-Net++ to improve image generation quality and structural fidelity. SEResNet strengthens critical feature representation through channel attention, while U-Net++ enhances multi-scale feature fusion. A simplified PatchGAN discriminator further stabilizes training and refines local anatomical realism. Experimental results demonstrate that under few-shot conditions with fewer than 500 images, the proposed method achieves consistent structural fidelity and superior image quality across multiple intra-modality MRI translation tasks, showing strong generalization ability. These results suggest an effective extension of Pix2Pix for medical image translation.
Problem

Research questions and friction points this paper is trying to address.

medical image translation
few-shot learning
structural fidelity
image quality
MRI
Innovation

Methods, ideas, or system contributions that make the work stand out.

SEResNet
U-Net++
Few-shot learning
Medical image translation
Channel attention
🔎 Similar Papers
No similar papers found.
Xihe Qiu
Xihe Qiu
Associate Professor, Shanghai University of Engineering Science
AI for HealthcareVision-Language ModelsReinforcement LearningLarge Language Models
Yang Dai
Yang Dai
Shenzhen Institutes of Advanced Technology,Chinese Academy of Sciences
perovskitesmemristor
X
Xiaoyu Tan
School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai, China
Sijia Li
Sijia Li
Institute of Information Engineering, Chinese Academy of Sciences
F
Fenghao Sun
Department of Thoracic Surgery, Zhongshan Hospital of Fudan University, Shanghai 200032, China
L
Lu Gan
Department of Medical Oncology, Cancer Center, and Fudan Zhangjiang Institute, Zhongshan Hospital of Fudan University, Shanghai 200032, China
L
Liang Liu
Clinical Research Unit, Institute of Clinical Science, Zhongshan Hospital of Fudan University, Shanghai 200032, China