SWinMamba: Serpentine Window State Space Model for Vascular Segmentation

πŸ“… 2025-07-01
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Medical image vessel segmentation often suffers from structural discontinuities due to the slender, tortuous morphology of vessels and insufficient prior modeling. To address this, we propose SwinMambaβ€”a novel hybrid architecture integrating state-space models with vision Transformers. Specifically, we design a Serpentine-Window Tokenizer (SWToken) for adaptive feature sampling and flexible receptive field control; introduce a Bidirectional Aggregation Module (BAM) to enhance local feature fusion; and construct a Spatial-Frequency Fusion Unit (SFFU) to improve continuity representation. By leveraging serpentine-window sequences, our method enables effective global context modeling, significantly strengthening connectivity modeling for thin, elongated vessels. Evaluated on three mainstream medical vessel datasets, SwinMamba achieves new state-of-the-art performance: 18.3% reduction in Hausdorff distance (improved completeness), 4.2% gain in F1-score (enhanced connectivity), and superior overall segmentation accuracy.

Technology Category

Application Category

πŸ“ Abstract
Vascular segmentation in medical images is crucial for disease diagnosis and surgical navigation. However, the segmented vascular structure is often discontinuous due to its slender nature and inadequate prior modeling. In this paper, we propose a novel Serpentine Window Mamba (SWinMamba) to achieve accurate vascular segmentation. The proposed SWinMamba innovatively models the continuity of slender vascular structures by incorporating serpentine window sequences into bidirectional state space models. The serpentine window sequences enable efficient feature capturing by adaptively guiding global visual context modeling to the vascular structure. Specifically, the Serpentine Window Tokenizer (SWToken) adaptively splits the input image using overlapping serpentine window sequences, enabling flexible receptive fields (RFs) for vascular structure modeling. The Bidirectional Aggregation Module (BAM) integrates coherent local features in the RFs for vascular continuity representation. In addition, dual-domain learning with Spatial-Frequency Fusion Unit (SFFU) is designed to enhance the feature representation of vascular structure. Extensive experiments on three challenging datasets demonstrate that the proposed SWinMamba achieves superior performance with complete and connected vessels.
Problem

Research questions and friction points this paper is trying to address.

Discontinuous vascular segmentation in medical images
Inadequate prior modeling of slender vascular structures
Need for accurate and connected vessel segmentation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Serpentine window sequences for vascular continuity
Bidirectional state space models integration
Dual-domain learning with spatial-frequency fusion
πŸ”Ž Similar Papers
No similar papers found.
Rongchang Zhao
Rongchang Zhao
Central South University
ophthalmology medical image analysiscomputer visionmachine learning
H
Huanchi Liu
School of Computer Science, Central South University
J
Jian Zhang
School of Computer Science, Central South University