Development and evaluation of CADe systems in low-prevalence setting: The RARE25 challenge for early detection of Barrett's neoplasia

πŸ“… 2026-04-13
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

240K/year
πŸ€– AI Summary
This study addresses the insufficient clinical evaluation of computer-aided detection (CADe) systems for early neoplasia in Barrett’s esophagus under low-prevalence screening settings by launching the RARE25 challenge. It establishes the first standardized benchmark tailored to low-prevalence scenarios, providing large-scale, prevalence-aware public training and hidden test sets, along with operating-point-specific metrics that prioritize high sensitivity while accounting for disease prevalence. Participating methods employed fully supervised classification frameworks, integrating diverse deep learning architectures, pretraining strategies, and model calibration techniques. Results revealed that although several models demonstrated strong discriminative performance, their positive predictive values remained consistently low, highlighting a pronounced sensitivity to prevalence shifts and a tendency to overestimate real-world clinical utility. These findings underscore the critical need for more robust and clinically practical CADe systems.

Technology Category

Application Category

πŸ“ Abstract
Computer-aided detection (CADe) of early neoplasia in Barrett's esophagus is a low-prevalence surveillance problem in which clinically relevant findings are rare. Although many CADe systems report strong performance on balanced or enriched datasets, their behavior under realistic prevalence remains insufficiently characterized. The RARE25 challenge addresses this gap by introducing a large-scale, prevalence-aware benchmark for neoplasia detection. It includes a public training set and a hidden test set reflecting real-world incidence. Methods were evaluated using operating-point-specific metrics emphasizing high sensitivity and accounting for prevalence. Eleven teams from seven countries submitted approaches using diverse architectures, pretraining, ensembling, and calibration strategies. While several methods achieved strong discriminative performance, positive predictive values remained low, highlighting the difficulty of low-prevalence detection and the risk of overestimating clinical utility when prevalence is ignored. All methods relied on fully supervised classification despite the dominance of normal findings, indicating a lack of prevalence-agnostic approaches such as anomaly detection or one-class learning. By releasing a public dataset and a reproducible evaluation framework, RARE25 aims to support the development of CADe systems robust to prevalence shift and suitable for clinical surveillance workflows.
Problem

Research questions and friction points this paper is trying to address.

Barrett's neoplasia
low-prevalence detection
computer-aided detection
clinical surveillance
prevalence shift
Innovation

Methods, ideas, or system contributions that make the work stand out.

low-prevalence detection
computer-aided detection (CADe)
Barrett's neoplasia
prevalence-aware benchmark
RARE25 challenge
T
Tim J. M. Jaspers
Department of Electrical Engineering, Architectures for Reliable Image Analysis Lab, Eindhoven University of Technology, Eindhoven, Netherlands
F
Francisco Caetano
Department of Electrical Engineering, Architectures for Reliable Image Analysis Lab, Eindhoven University of Technology, Eindhoven, Netherlands
C
Cris H. B. Claessens
Department of Electrical Engineering, Architectures for Reliable Image Analysis Lab, Eindhoven University of Technology, Eindhoven, Netherlands
C
Carolus H. J. Kusters
Department of Electrical Engineering, Architectures for Reliable Image Analysis Lab, Eindhoven University of Technology, Eindhoven, Netherlands
R
Rixta A. H. van Eijck van Heslinga
Department of Gastroenterology and Hepatology, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
F
Floor Slooter
Department of Gastroenterology and Hepatology, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
J
Jacques J. Bergman
Department of Gastroenterology and Hepatology, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
P
Peter H. N. De With
Department of Electrical Engineering, Architectures for Reliable Image Analysis Lab, Eindhoven University of Technology, Eindhoven, Netherlands
M
Martijn R. Jong
Department of Gastroenterology and Hepatology, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
A
Albert J. de Groof
Department of Gastroenterology and Hepatology, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
Fons van der Sommen
Fons van der Sommen
Associate Professor, Eindhoven University of Technology
Image processingComputer VisionMedical Image AnalysisComputer-Aided DiagnosisMachine learning