AtlasPatch: An Efficient and Scalable Tool for Whole Slide Image Preprocessing in Computational Pathology

📅 2026-02-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the inefficiencies in whole-slide image (WSI) preprocessing, where existing tissue detection methods either rely on inaccurate heuristic thresholds or incur substantial computational overhead, thereby hindering AI-driven pathology workflows. To overcome these limitations, we propose AtlasPatch—an efficient and scalable WSI preprocessing framework that, for the first time, leverages a fine-tuned Segment Anything Model to achieve precise tissue segmentation on large-scale, heterogeneous WSI thumbnails. AtlasPatch integrates an efficient mask extrapolation strategy with CPU/GPU parallelization to enable high-throughput patch extraction and streaming processing. Trained on approximately 30,000 semi-manually annotated WSI thumbnails, AtlasPatch achieves state-of-the-art performance in tissue segmentation accuracy, computational efficiency, and downstream multiple-instance learning tasks, significantly reducing computational costs. The code is publicly released.

Technology Category

Application Category

📝 Abstract
Whole-slide image (WSI) preprocessing, typically comprising tissue detection followed by patch extraction, is foundational to AI-driven computational pathology workflows. This remains a major computational bottleneck as existing tools either rely on inaccurate heuristic thresholding for tissue detection, or adopt AI-based approaches trained on limited-diversity data that operate at the patch level, incurring substantial computational complexity. We present AtlasPatch, an efficient and scalable slide preprocessing framework for accurate tissue detection and high-throughput patch extraction with minimal computational overhead. AtlasPatch's tissue detection module is trained on a heterogeneous and semi-manually annotated dataset of ~30,000 WSI thumbnails, using efficient fine-tuning of the Segment-Anything model. The tool extrapolates tissue masks from thumbnails to full-resolution slides to extract patch coordinates at user-specified magnifications, with options to stream patches directly into common image encoders for embedding or store patch images, all efficiently parallelized across CPUs and GPUs. We assess AtlasPatch across segmentation precision, computational complexity, and downstream multiple-instance learning, matching state-of-the-art performance while operating at a fraction of their computational cost. AtlasPatch is open-source and available at https://github.com/AtlasAnalyticsLab/AtlasPatch.
Problem

Research questions and friction points this paper is trying to address.

whole-slide image
tissue detection
patch extraction
computational pathology
computational bottleneck
Innovation

Methods, ideas, or system contributions that make the work stand out.

AtlasPatch
tissue detection
Segment-Anything
whole-slide image preprocessing
scalable patch extraction
🔎 Similar Papers
No similar papers found.
Ahmed Alagha
Ahmed Alagha
Postdoctoral Fellow, FRQNT Scholar
Deep Reinforcement LearningImitation LearningRoboticsCrowdsourcing
C
Christopher Leclerc
Department of Computer Science and Software Engineering (CSSE), Concordia University, Montreal, QC, Canada, H3G 1M8
Y
Yousef Kotp
Department of Computer Science and Software Engineering (CSSE), Concordia University, Montreal, QC, Canada, H3G 1M8; Mila–Quebec AI Institute, Montreal, QC, Canada, H2S 3H1
O
Omar Metwally
Department of Computer Science and Software Engineering (CSSE), Concordia University, Montreal, QC, Canada, H3G 1M8
C
Calvin Moras
Department of Computer Science and Software Engineering (CSSE), Concordia University, Montreal, QC, Canada, H3G 1M8
P
Peter Rentopoulos
Department of Computer Science and Software Engineering (CSSE), Concordia University, Montreal, QC, Canada, H3G 1M8
Ghodsiyeh Rostami
Ghodsiyeh Rostami
PhD Student @ Concordia University
Computer VisionDeep LearningInfrastructure Damage Detection
B
Bich Ngoc Nguyen
University of Montreal Hospital Center (CHUM), Montreal, QC, Canada, H2X 0C1
J
Jumanah Baig
University of Montreal Hospital Center (CHUM), Montreal, QC, Canada, H2X 0C1
A
Abdelhakim Khellaf
University of Montreal Hospital Center (CHUM), Montreal, QC, Canada, H2X 0C1
Vincent Quoc-Huy Trinh
Vincent Quoc-Huy Trinh
University of Montreal
Pathology GI Liver Pancreas
R
R. Mizouni
Department of Computer Science, Khalifa University, Abu Dhabi, UAE
Hadi Otrok
Hadi Otrok
Chair and Professor Computer Science, Khalifa University
Network & Computer SecurityBlockchain & Game TheoryReinforcement Learning
Jamal Bentahar
Jamal Bentahar
Concordia University
Deep Reinforcement LearningFederated LearningMulti-Agent SystemsVerificationServices Computing and IoT
Mahdi S. Hosseini
Mahdi S. Hosseini
Assistant Professor, Concordia University, Mila Quebec AI Institute, McGill University
Computer VisionDeep LearningComputational Pathology