Multiscale Structure-Guided Latent Diffusion for Multimodal MRI Translation

📅 2026-03-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenges of anatomical inconsistency and texture degradation in multimodal MRI synthesis caused by arbitrary missing modalities. To this end, the authors propose the MSG-LDM framework, which introduces a multi-scale style-structure disentanglement mechanism within a latent diffusion model. This approach explicitly separates modality-specific stylistic features from shared anatomical structures and jointly models low-frequency spatial layouts and high-frequency boundary details. By incorporating a structure-aware loss and style consistency constraints, the method significantly enhances structural stability and detail fidelity in synthesized images. Experimental results on the BraTS2020 and WMH datasets demonstrate that the proposed framework outperforms existing methods in both anatomical completeness and overall image quality.

Technology Category

Application Category

📝 Abstract
Although diffusion models have achieved remarkable progress in multi-modal magnetic resonance imaging (MRI) translation tasks, existing methods still tend to suffer from anatomical inconsistencies or degraded texture details when handling arbitrary missing-modality scenarios. To address these issues, we propose a latent diffusion-based multi-modal MRI translation framework, termed MSG-LDM. By leveraging the available modalities, the proposed method infers complete structural information, which preserves reliable boundary details. Specifically, we introduce a style--structure disentanglement mechanism in the latent space, which explicitly separates modality-specific style features from shared structural representations, and jointly models low-frequency anatomical layouts and high-frequency boundary details in a multi-scale feature space. During the structure disentanglement stage, high-frequency structural information is explicitly incorporated to enhance feature representations, guiding the model to focus on fine-grained structural cues while learning modality-invariant low-frequency anatomical representations. Furthermore, to reduce interference from modality-specific styles and improve the stability of structure representations, we design a style consistency loss and a structure-aware loss. Extensive experiments on the BraTS2020 and WMH datasets demonstrate that the proposed method outperforms existing MRI synthesis approaches, particularly in reconstructing complete structures. The source code is publicly available at https://github.com/ziyi-start/MSG-LDM.
Problem

Research questions and friction points this paper is trying to address.

multimodal MRI translation
anatomical inconsistency
texture degradation
missing-modality scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

latent diffusion
structure-style disentanglement
multiscale feature modeling
multimodal MRI translation
anatomical consistency
🔎 Similar Papers
No similar papers found.
J
Jianqiang Lin
Computer Science and Engineering, Northeastern University, Shenyang, China; Key Laboratory of Intelligent Computing in Medical Image of Ministry of Education, Northeastern University, Shenyang, China
Zhiqiang Shen
Zhiqiang Shen
Northeastern University
Medical Image ComputingSemi-Supervised LearningDomain Generalization
Peng Cao
Peng Cao
Northeastern University
Data miningMachine learninig
J
Jinzhu Yang
Computer Science and Engineering, Northeastern University, Shenyang, China; Key Laboratory of Intelligent Computing in Medical Image of Ministry of Education, Northeastern University, Shenyang, China; National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Shenyang, China
O
Osmar R. Zaiane
Amii, University of Alberta, Edmonton, Alberta, Canada
X
Xiaoli Liu
AiShiWeiLai AI Research, Beijing, China