SemaMIL: Semantic Reordering with Retrieval-Guided State Space Modeling for Whole Slide Image Classification

📅 2025-08-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing multi-instance learning (MIL) methods for whole-slide image (WSI) classification suffer from three key limitations: attention mechanisms neglect local histological context; Transformers incur high computational complexity and are prone to overfitting; and state-space models (SSMs) degrade pathological interpretability due to random token ordering. To address these, we propose SemaMIL—a novel MIL framework. Its core contributions are: (i) a clustering-driven *reversible semantic reordering* that preserves histological spatial coherence while enhancing sequence-level semantic consistency; and (ii) *retrieval-guided state-space modeling*, which enables efficient long-range dependency capture via query-subset retrieval. This design achieves linear-time complexity while supporting high-order feature interactions. Evaluated on four WSI subtyping benchmarks, SemaMIL attains state-of-the-art accuracy, reduces FLOPs and parameter count significantly, and improves both model interpretability and clinical applicability.

Technology Category

Application Category

📝 Abstract
Multiple instance learning (MIL) has become the leading approach for extracting discriminative features from whole slide images (WSIs) in computational pathology. Attention-based MIL methods can identify key patches but tend to overlook contextual relationships. Transformer models are able to model interactions but require quadratic computational cost and are prone to overfitting. State space models (SSMs) offer linear complexity, yet shuffling patch order disrupts histological meaning and reduces interpretability. In this work, we introduce SemaMIL, which integrates Semantic Reordering (SR), an adaptive method that clusters and arranges semantically similar patches in sequence through a reversible permutation, with a Semantic-guided Retrieval State Space Module (SRSM) that chooses a representative subset of queries to adjust state space parameters for improved global modeling. Evaluation on four WSI subtype datasets shows that, compared to strong baselines, SemaMIL achieves state-of-the-art accuracy with fewer FLOPs and parameters.
Problem

Research questions and friction points this paper is trying to address.

Addresses attention-based MIL overlooking contextual relationships in WSIs
Solves transformer models' quadratic computational cost and overfitting issues
Overcomes state space models' loss of histological meaning from shuffling
Innovation

Methods, ideas, or system contributions that make the work stand out.

Semantic reordering clusters similar patches
Retrieval-guided state space modeling reduces complexity
Representative query subset adjusts state parameters
🔎 Similar Papers
No similar papers found.
L
Lubin Gan
USTC, Anhui, China
X
Xiaoman Wu
USTC, Anhui, China
J
Jing Zhang
Anhui Province Key Laboratory of Biomedical Imaging and Intelligent Processing, Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Anhui, China
Zhifeng Wang
Zhifeng Wang
Liaoning University
economics
Linhao Qu
Linhao Qu
Ph.D. School of Basic Medical Sciences, Fudan University
computational pathologymedical image analysismultimodal information fusiondata mining
S
Siying Wu
Anhui Province Key Laboratory of Biomedical Imaging and Intelligent Processing, Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Anhui, China
Xiaoyan Sun
Xiaoyan Sun
Microsoft Research Asia
Image/Video CodingMultimedia ProcessingComputer Vision