MacNet: An End-to-End Manifold-Constrained Adaptive Clustering Network for Interpretable Whole Slide Image Classification

📅 2026-02-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limitations of existing whole-slide image classification methods in interpretability and semantic clarity of learned features, particularly the susceptibility of clustering to high-dimensional noise and semantically ambiguous cluster centers. To overcome these challenges, the authors propose an end-to-end multi-instance learning framework that uniquely integrates Grassmann manifold re-embedding with manifold-adaptive clustering, complemented by a prior knowledge–guided proxy instance annotation mechanism to emphasize pathologically relevant regions. By leveraging the geometric structure of the Grassmann manifold, the approach enhances clustering robustness and achieves significant improvements in both grading accuracy and model interpretability across multi-center datasets, while enabling efficient and principled end-to-end feature learning.

Technology Category

Application Category

📝 Abstract
Whole slide images (WSIs) are the gold standard for pathological diagnosis and sub-typing. Current main-stream two-step frameworks employ offline feature encoders trained without domain-specific knowledge. Among them, attention-based multiple instance learning (MIL) methods are outcome-oriented and offer limited interpretability. Clustering-based approaches can provide explainable decision-making process but suffer from high dimension features and semantically ambiguous centroids. To this end, we propose an end-to-end MIL framework that integrates Grassmann re-embedding and manifold adaptive clustering, where the manifold geometric structure facilitates robust clustering results. Furthermore, we design a prior knowledge guiding proxy instance labeling and aggregation strategy to approximate patch labels and focus on pathologically relevant tumor regions. Experiments on multicentre WSI datasets demonstrate that: 1) our cluster-incorporated model achieves superior performance in both grading accuracy and interpretability; 2) end-to-end learning refines better feature representations and it requires acceptable computation resources.
Problem

Research questions and friction points this paper is trying to address.

whole slide image classification
interpretable AI
clustering ambiguity
high-dimensional features
multiple instance learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

manifold-constrained clustering
Grassmann re-embedding
interpretable MIL
proxy instance labeling
end-to-end WSI classification
🔎 Similar Papers
No similar papers found.
M
Mingrui Ma
School of Computer Science and Technology, Xinjiang University, Urumqi 830046, China, and the Affiliated Tumor Hospital of Xinjiang Medical University, Xinjiang 830011, China
C
Chentao Li
Fu Foundation School of Engineering and Applied Science, Columbia University, New York, NY 10027, USA
P
Pan Huang
School of Nursing, The Hong Kong Polytechnic University, Hong Kong 999077, SAR China
Jing Qin
Jing Qin
University of Southern Denmark
MathematicsStatistics