A Histologic Dataset of Normal and Atypical Mitotic Figures on Human Breast Cancer (AMi-Br)

📅 2025-01-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Accurate identification of atypical mitotic figures (AMFs) in breast cancer histopathology remains challenging due to their morphological ambiguity and scarcity of high-quality, expert-annotated datasets. Method: We introduce the first publicly available, expert-consensus–annotated AMF dataset comprising whole-slide images from 223 patients and 3,720 annotated mitotic instances. To better reflect clinical generalizability, we propose and validate a patient-level evaluation paradigm—distinct from conventional patch-level assessment—and integrate Monte Carlo cross-validation, class-imbalance mitigation strategies, and deep learning models for benchmarking. Contribution/Results: Our experiments achieve a mean balanced accuracy of 0.806 at the patch level and 0.713 at the patient level. These results empirically substantiate AMFs as an independent prognostic biomarker and demonstrate the dataset’s high annotation fidelity and suitability for clinically relevant modeling.

Technology Category

Application Category

📝 Abstract
Assessment of the density of mitotic figures (MFs) in histologic tumor sections is an important prognostic marker for many tumor types, including breast cancer. Recently, it has been reported in multiple works that the quantity of MFs with an atypical morphology (atypical MFs, AMFs) might be an independent prognostic criterion for breast cancer. AMFs are an indicator of mutations in the genes regulating the cell cycle and can lead to aberrant chromosome constitution (aneuploidy) of the tumor cells. To facilitate further research on this topic using pattern recognition, we present the first ever publicly available dataset of atypical and normal MFs (AMi-Br). For this, we utilized two of the most popular MF datasets (MIDOG 2021 and TUPAC) and subclassified all MFs using a three expert majority vote. Our final dataset consists of 3,720 MFs, split into 832 AMFs (22.4%) and 2,888 normal MFs (77.6%) across all 223 tumor cases in the combined set. We provide baseline classification experiments to investigate the consistency of the dataset, using a Monte Carlo cross-validation and different strategies to combat class imbalance. We found an averaged balanced accuracy of up to 0.806 when using a patch-level data set split, and up to 0.713 when using a patient-level split.
Problem

Research questions and friction points this paper is trying to address.

Breast Cancer
Mitotic Figures Analysis
Image Recognition
Innovation

Methods, ideas, or system contributions that make the work stand out.

AMi-Br Dataset
Aberrant Mitotic Figures (AMFs)
Breast Cancer Severity Prediction
🔎 Similar Papers
No similar papers found.
C
Christof A. Bertram
University of Veterinary Medicine, Vienna, Austria
V
Viktoria Weiss
University of Veterinary Medicine, Vienna, Austria
T
Taryn A. Donovan
The Schwarzman Animal Medical Center, New York, USA
Sweta Banerjee
Sweta Banerjee
Research Assistant - Flensburg University of Applied Sciences
self-supervised learningdomain adaptationmulti-modal approaches in histopathology
Jonas Ammeling
Jonas Ammeling
Technische Hochschule Ingolstadt
Computer VisionDeep LearningComputational Pathology
R
Robert Klopfleisch
Freie Universität Berlin, Berlin, Germany
C
Christopher Kaltenecker
Medical University of Vienna, Vienna, Austria
Marc Aubreville
Marc Aubreville
Professor at Flensburg University of Applied Sciences, Flensburg, Germany
Computer VisionDeep LearningSignal Processing