๐ค AI Summary
To address the low segmentation accuracy of irregularly shaped and minute tumors in medical images, this paper proposes a Synergistic Multi-Attention Transformer (SMA-Transformer). The architecture innovatively integrates pixel-wise, channel-wise, and spatial-wise attention mechanisms, coupled with a feature fusion modulator to jointly model local details and global contextโthereby mitigating information loss during attention transformation and feature recalibration. Its modular design incorporates residual connections to enhance gradient flow and feature reuse. Evaluated on multi-organ, liver tumor, and bladder tumor segmentation tasks, the method achieves state-of-the-art (SOTA) performance, particularly for small-target segmentation. It delivers significant improvements in key metrics: Dice coefficient and 95th-percentile Hausdorff Distance (HD95), demonstrating superior accuracy and boundary localization.
๐ Abstract
In medical image segmentation, specialized computer vision techniques, notably transformers grounded in attention mechanisms and residual networks employing skip connections, have been instrumental in advancing performance. Nonetheless, previous models often falter when segmenting small, irregularly shaped tumors. To this end, we introduce SMAFormer, an efficient, Transformer-based architecture that fuses multiple attention mechanisms for enhanced segmentation of small tumors and organs. SMAFormer can capture both local and global features for medical image segmentation. The architecture comprises two pivotal components. First, a Synergistic Multi-Attention (SMA) Transformer block is proposed, which has the benefits of Pixel Attention, Channel Attention, and Spatial Attention for feature enrichment. Second, addressing the challenge of information loss incurred during attention mechanism transitions and feature fusion, we design a Feature Fusion Modulator. This module bolsters the integration between the channel and spatial attention by mitigating reshaping-induced information attrition. To evaluate our method, we conduct extensive experiments on various medical image segmentation tasks, including multi-organ, liver tumor, and bladder tumor segmentation, achieving state-of-the-art results. Code and models are available at: https://github.com/lzeeorno/SMAFormer.