Thinking in Scales: Accelerating Gigapixel Pathology Image Analysis via Adaptive Continuous Reasoning

📅 2026-05-19
📈 Citations: 0
Influential: 0
📄 PDF

career value

181K/year
🤖 AI Summary
This work addresses the high computational cost and inefficiency of conventional whole-slide image analysis, which relies on multiple instance learning and processes a large number of high-magnification image patches. The authors propose PathCTM, a novel model that formulates pathological diagnosis as a dynamic, sequential reasoning process. Starting from a low-magnification global view, PathCTM employs an attention mechanism to guide region pruning and adaptively switches magnification scales or terminates inference early based on prediction confidence. By integrating conditional computation, dynamic scale selection, and early stopping, the method achieves both high diagnostic accuracy and remarkable efficiency. Experimental results demonstrate that PathCTM reduces patch usage by 95.95% and inference time by 95.62% compared to baseline approaches, while preserving AUC performance without degradation.
📝 Abstract
Traditional whole slide image (WSI) analysis methods typically rely on the multiple instance learning (MIL) paradigm, which extracts patch-level features at high magnification and aggregates them for slide-level prediction. However, such exhaustive patch-level processing is computationally expensive, severely limiting the efficiency and scalability of WSI analysis. To address this challenge, we propose PathCTM (a Pathology-oriented Continuous Thought Model) that enables token-efficient scale-space continuous reasoning for gigapixel WSIs. PathCTM formulates diagnostic inference as a dynamic sequential information pursuit. It progressively transitions from low-magnification global to high-magnification local inspection, and adaptively terminates inference when sufficient evidence is gathered to effectively bound decision uncertainty. Specifically, it uses conditional computation for dynamic scale switching with attention-guided region pruning, coupled with confidence-aware early stopping. Extensive experiments demonstrate that, compared with standard MIL-based methods, PathCTM reduces the number of required image patches by 95.95% and shortens inference time by approximately 95.62%, while maintaining AUC without degradation. Code is available at https://github.com/JSGe-AI/PathCTM.
Problem

Research questions and friction points this paper is trying to address.

whole slide image
multiple instance learning
computational efficiency
gigapixel pathology
scale-space reasoning
Innovation

Methods, ideas, or system contributions that make the work stand out.

adaptive continuous reasoning
conditional computation
attention-guided pruning
confidence-aware early stopping
scale-space inference
🔎 Similar Papers
No similar papers found.
J
Jiusong Ge
School of Computer Science and Technology, Xi’an Jiao Tong University, Xi’an, China
Y
Yingkang Zhan
School of Computer Science and Technology, Xi’an Jiao Tong University, Xi’an, China
Wenjie Zhao
Wenjie Zhao
University of Texas at Dallas
computer vision
Di Zhang
Di Zhang
Department of statistics and data science, National University of Singapore
Causal inferenceSemi-parametric modelGenetic statistics
K
Ke Wang
Department of Transmedia Art, Xi’an Academy of Fine Arts, Xi’an, China
J
Jiashuai Liu
School of Computer Science and Technology, Xi’an Jiao Tong University, Xi’an, China
C
Chunze Yang
School of Computer Science and Technology, Xi’an Jiao Tong University, Xi’an, China
Chengzu Li
Chengzu Li
University of Cambridge
Natural Language Processing
Jian Zhang
Jian Zhang
Xi'an Jiaotong University | Nanyang Technological University
Natural Language ProcessingLarge Language ModelsEvent Graph
Yuxin Dong
Yuxin Dong
Ohio State University
machine learninginformation theorylearning theory
Ni Zhang
Ni Zhang
Xi’an Jiaotong University
Saliency Detection
Qidong Liu
Qidong Liu
Assistant Professor, Xi'an Jiaotong University
Recommender SystemLarge Language ModelIntelligent HealthcareCausal InferenceSmart Education
M
Mireia Crispin-Ortuzar
Department of Oncology, University of Cambridge, Cambridge, U.K.
Huazhu Fu
Huazhu Fu
Principal Scientist, IHPC, A*STAR
Medical Image AnalysisAI for HealthcareMedical AITrustworthy AI
Chen Li
Chen Li
Xi'an Jiaotong University
Zeyu Gao
Zeyu Gao
University of Cambridge
deep learningmechine learningimage processingmedical imaginghyperspectral imaging