XLSTM-HVED: Cross-Modal Brain Tumor Segmentation and MRI Reconstruction Method Using Vision XLSTM and Heteromodal Variational Encoder-Decoder

πŸ“… 2024-12-09
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

217K/year
πŸ€– AI Summary
To address the challenge of imprecise tumor boundary delineation in glioma MRI due to missing multi-modal scans, this paper proposes a novel multi-task framework jointly optimizing tumor segmentation and cross-modal reconstruction. Methodologically, we introduce the Self-Attention Variational Encoder (SAVE) to enhance heterogeneous modality fusion and design the Squeeze-Fusion-Excitation Cross-Attention (SFECA) module to enable task-cooperative optimization between segmentation and reconstruction. The overall architecture integrates Vision XLSTM with a Heterogeneous Variational Encoder-Decoder (HVED), facilitating spatiotemporal deep feature fusion. Evaluated on the BraTS 2024 dataset under single- and dual-modality missing scenarios, our method achieves state-of-the-art performance: a segmentation Dice score improvement of over 3.2% and a 4.7 dB gain in reconstruction PSNR, demonstrating superior robustness and generalizability compared to existing approaches.

Technology Category

Application Category

πŸ“ Abstract
Neurogliomas are among the most aggressive forms of cancer, presenting considerable challenges in both treatment and monitoring due to their unpredictable biological behavior. Magnetic resonance imaging (MRI) is currently the preferred method for diagnosing and monitoring gliomas. However, the lack of specific imaging techniques often compromises the accuracy of tumor segmentation during the imaging process. To address this issue, we introduce the XLSTM-HVED model. This model integrates a hetero-modal encoder-decoder framework with the Vision XLSTM module to reconstruct missing MRI modalities. By deeply fusing spatial and temporal features, it enhances tumor segmentation performance. The key innovation of our approach is the Self-Attention Variational Encoder (SAVE) module, which improves the integration of modal features. Additionally, it optimizes the interaction of features between segmentation and reconstruction tasks through the Squeeze-Fusion-Excitation Cross Awareness (SFECA) module. Our experiments using the BraTS 2024 dataset demonstrate that our model significantly outperforms existing advanced methods in handling cases where modalities are missing. Our source code is available at https://github.com/Quanato607/XLSTM-HVED.
Problem

Research questions and friction points this paper is trying to address.

Magnetic Resonance Imaging (MRI)
Brain Tumor Segmentation
Image Clarity
Innovation

Methods, ideas, or system contributions that make the work stand out.

XLSTM-HVED
Visual XLSTM
SAVE Module
Shenghao Zhu
Shenghao Zhu
University of International Business and Economics
MacroeconomicsInequality
Y
Yifei Chen
Hangzhou Dianzi University, Hangzhou, China
S
Shuo Jiang
Hangzhou Dianzi University, Hangzhou, China
W
Weihong Chen
Hangzhou Dianzi University, Hangzhou, China
C
Chang Liu
Hangzhou Dianzi University, Hangzhou, China
Y
Yuanhan Wang
Hangzhou Dianzi University, Hangzhou, China
X
Xu Chen
Hangzhou Dianzi University, Hangzhou, China
Y
Yifan Ke
Hangzhou Dianzi University, Hangzhou, China
Feiwei Qin
Feiwei Qin
Prof. College of Computer Science, Hangzhou Dianzi University
Artificial IntelligenceComputer-Aided DesignComputer VisionMedical Image Analysis
C
Changmiao Wang
Children’s Hospital, Zhejiang University School of Medicine, Hangzhou, China
Z
Zhu Zhu
Shenzhen Research Institute of Big Data, Shenzhen, China; Sino-Finland Joint AI Laboratory for Child Health of Zhejiang Province, Hangzhou, China