Global and Local Mamba Network for Multi-Modality Medical Image Super-Resolution

📅 2025-04-14

📈 Citations: 0

✨ Influential: 0

career value

209K/year

🤖 AI Summary

To address the inherent trade-off in multimodal medical image super-resolution—where CNNs suffer from fixed local receptive fields while Transformers incur prohibitive computational costs for global modeling—this paper proposes a dual-branch Mamba architecture. The global branch leverages State Space Models (SSMs) for efficient long-range dependency modeling, while the local branch employs deformable convolution and a feature modulator to adaptively capture short-range structural details. We introduce the first global–local decoupled design, augmented by a multimodal feature fusion block and a novel Contrastive Edge Loss (CELoss), significantly enhancing edge-texture recovery and cross-modal complementary modeling. Evaluated on MRI/PET datasets, our method achieves state-of-the-art performance, with substantial improvements in PSNR and SSIM, markedly better structural fidelity and edge sharpness, while maintaining linear inference complexity.

Technology Category

Application Category

📝 Abstract

Convolutional neural networks and Transformer have made significant progresses in multi-modality medical image super-resolution. However, these methods either have a fixed receptive field for local learning or significant computational burdens for global learning, limiting the super-resolution performance. To solve this problem, State Space Models, notably Mamba, is introduced to efficiently model long-range dependencies in images with linear computational complexity. Relying on the Mamba and the fact that low-resolution images rely on global information to compensate for missing details, while high-resolution reference images need to provide more local details for accurate super-resolution, we propose a global and local Mamba network (GLMamba) for multi-modality medical image super-resolution. To be specific, our GLMamba is a two-branch network equipped with a global Mamba branch and a local Mamba branch. The global Mamba branch captures long-range relationships in low-resolution inputs, and the local Mamba branch focuses more on short-range details in high-resolution reference images. We also use the deform block to adaptively extract features of both branches to enhance the representation ability. A modulator is designed to further enhance deformable features in both global and local Mamba blocks. To fully integrate the reference image for low-resolution image super-resolution, we further develop a multi-modality feature fusion block to adaptively fuse features by considering similarities, differences, and complementary aspects between modalities. In addition, a contrastive edge loss (CELoss) is developed for sufficient enhancement of edge textures and contrast in medical images.

Problem

Research questions and friction points this paper is trying to address.

Overcoming fixed receptive fields and computational burdens in medical image super-resolution

Modeling long-range dependencies efficiently with linear complexity

Adaptively fusing multi-modality features for enhanced super-resolution

Innovation

Methods, ideas, or system contributions that make the work stand out.

Global and local Mamba branches for image details

Deform block enhances adaptive feature extraction

Multi-modality fusion block integrates reference images

🔎 Similar Papers

I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling

2024-05-22arXiv.orgCitations: 16

Altos Labs

Scientist I, Machine Learning: $200,900 - $257,500 Scientist II, Machine Learning: $226,200 - $290,000 Senior Scientist I, Machine Learning: $257,400 - $330,000 Scientist I, Machine Learning: $179,400 - $230,000 Scientist II, Machine Learning: $212,900 - $273,000 Senior Scientist I, Machine Learning: $239,500 - $307,000

Redwood City, CA / San Diego, CA

Research Scientist Intern, Multimodal Generative AI and Robotics (PhD)