MMNavAgent: Multi-Magnification WSI Navigation Agent for Clinically Consistent Whole-Slide Analysis

📅 2026-03-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the limitation of existing AI methods in whole-slide image (WSI) diagnosis, which typically rely on a single or predefined magnification level and thus fail to emulate the multi-scale, dynamic interaction inherent in pathologists’ clinical workflow. To bridge this gap, the authors propose a clinically aligned multi-magnification navigation agent that innovatively integrates a Cross-Magnification Navigation Tool (CMT) and a Magnification Selection Tool (MST). By leveraging context-aware multi-scale representation fusion and memory-driven adaptive decision-making, the agent performs sequential diagnostic analysis akin to human pathologists. Evaluated on public datasets, the proposed method significantly outperforms non-agent baselines, achieving a 1.45% improvement in AUC and a 2.93% gain in balanced accuracy (BACC).

Technology Category

Application Category

📝 Abstract
Recent AI navigation approaches aim to improve Whole-Slide Image (WSI) diagnosis by modeling spatial exploration and selecting diagnostically relevant regions, yet most operate at a single fixed magnification or rely on predefined magnification traversal. In clinical practice, pathologists examine slides across multiple magnifications and selectively inspect only necessary scales, dynamically integrating global and cellular evidence in a sequential manner. This mismatch prevents existing methods from modeling cross-magnification interactions and adaptive magnification selection inherent to real diagnostic workflows. To these, we propose a clinically consistent Multi-Magnification WSI Navigation Agent (MMNavAgent) that explicitly models multi magnification interaction and adaptive magnification selection. Specifically, we introduce a Cross-Magnification navigation Tool (CMT) that aggregates contextual information from adjacent magnifications to enhance discriminative representations along the navigation path. We further introduce a Magnification Selection Tool (MST) that leverages memory-driven reasoning within the agent framework to enable interactive and adaptive magnification selection, mimicking the sequential decision process of pathologists. Extensive experiments on a public dataset demonstrate improved diagnostic performance, with 1.45% gain of AUC and 2.93% gain of BACC over a non-agent baseline. Code will be public upon acceptance.
Problem

Research questions and friction points this paper is trying to address.

Whole-Slide Image
multi-magnification
adaptive magnification selection
clinical diagnosis
cross-magnification interaction
Innovation

Methods, ideas, or system contributions that make the work stand out.

multi-magnification navigation
adaptive magnification selection
cross-magnification interaction
memory-driven reasoning
whole-slide image analysis
🔎 Similar Papers
No similar papers found.
Z
Zhengyang Xu
Institute of Pathology, Technical University of Munich, Germany
Han Li
Han Li
Computer Aided Medical Procedures (CAMP), Technische Universitaet Muenchen (TUM).
medical AI
J
Jingsong Liu
Institute of Pathology, Technical University of Munich, Germany
L
Linrui Xie
Northwest University of China
X
Xun Ma
Institute of Pathology, Technical University of Munich, Germany
Xin You
Xin You
Beihang University
Performance Tool、HPC
S
Shihui Zu
Dalian University of Technology
A
Ayako Ito
Department of Human Pathology, Juntendo University Graduate School of Medicine
X
Xinyu Hao
Dalian University of Technology
H
Hongming Xu
Dalian University of Technology
Shaohua Kevin Zhou
Shaohua Kevin Zhou
Professor, USTC, FAIMBE, FIAMBE, FIEEE, FMICCAI, FNAI
Medical Image ComputingComputer Vision & Pattern RecognitionMachine & Deep Learning
Nassir Navab
Nassir Navab
Professor of Computer Science, Technische Universität München
P
Peter J. Schüffler
Institute of Pathology, Technical University of Munich, Germany