Towards automatizing detection and quantification of intestinal metaplasia: a multi-expert comparative study

📅 2025-09-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Gastric cancer risk assessment commonly relies on pathologists’ subjective visual estimation of intestinal metaplasia (IM) percentage, resulting in poor reproducibility and inter-observer agreement. To address this, we propose the first deep learning–based framework for automatic IM quantification and systematically evaluate its consistency against multiple expert pathologists in IM grading. Our method employs a multi-scale convolutional architecture and is assessed using F1-score, area under the curve (AUC), and Fleiss’s kappa. The optimal model achieves F1 = 0.80 and AUC = 0.91—significantly surpassing the moderate inter-expert agreement (κ = 0.61–0.75) and revealing substantial variability in manual assessment (low κ between model and individual experts). This study provides the first empirical evidence that AI-based IM quantification attains both high accuracy and superior reproducibility, establishing a robust, standardized technical foundation for gastric cancer risk stratification.

Technology Category

Application Category

📝 Abstract
Current gastric cancer risk systems are prone to errors since they evaluate a visual estimation of intestinal metaplasia percentages to assign a risk. This study presents an automated method to detect and quantify intestinal metaplasia using deep learning models as well as a comparative analysis with visual estimations of three experienced pathologists. Gastric samples were collected from two different cohorts: 149 asymptomatic volunteers from a region with a high prevalence of GCa in Colombia and 56 patients from a third-level hospital. Deep learning models were selected and trained to classify intestinal metaplasia, and predictions were used to estimate the percentage of intestinal metaplasia and assign the risk score. Results were compared with independent blinded assessments performed by three experienced pathologists. The best-performing deep learning architecture classified intestinal metaplasia with F1-Score of 0.80 +- 0.01 and AUC of 0.91 +- 0.01. Among pathologists, inter-observer agreement by a Fleiss's Kappa score ranged from 0.61 to 0.75. In comparison, agreement between the pathologists and the best-performing model ranged from 0.37 to 0.54. Deep learning models show potential to detect and quantify the percentage of intestinal metaplasia with greater precision and reproducibility than experienced pathologists. Likewise, estimated risk shows high inter-observer variability when visually assigning the intestinal metaplasia percentage.
Problem

Research questions and friction points this paper is trying to address.

Automating detection and quantification of intestinal metaplasia
Reducing errors in gastric cancer risk assessment systems
Comparing deep learning models with pathologists' visual estimations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep learning models automate intestinal metaplasia detection
Quantifies metaplasia percentage for gastric cancer risk
Compares AI performance against expert pathologists' assessments
🔎 Similar Papers
No similar papers found.
F
Fabian Cano
Computer Imaging and Medical Applications Laboratory - Cim@lab, Universidad Nacional de Colombia, Bogotá D.C., Colombia
M
Mauricio Caviedes
Computer Imaging and Medical Applications Laboratory - Cim@lab, Universidad Nacional de Colombia, Bogotá D.C., Colombia
A
Andres Siabatto
Computer Imaging and Medical Applications Laboratory - Cim@lab, Universidad Nacional de Colombia, Bogotá D.C., Colombia
J
Jesus Villarreal
Computer Imaging and Medical Applications Laboratory - Cim@lab, Universidad Nacional de Colombia, Bogotá D.C., Colombia
J
Jose Quijano
Computer Imaging and Medical Applications Laboratory - Cim@lab, Universidad Nacional de Colombia, Bogotá D.C., Colombia
A
Alvaro Bedoya
Centro de Investigación de Enfermedades digestivas y nutricionales - CIEDYN, Hospital Universitario Departamental de Nariño, Pasto, Colombia
Satish E. Viswanath
Satish E. Viswanath
Associate Professor, Departments of Pediatrics & Biomedical Engineering, Emory University
Artificial IntelligenceRadiomicsMachine LearningComputational ImagingDigital Pathology
Angel Cruz-Roa
Angel Cruz-Roa
School of Engineering, Universidad de los Llanos
Biomedical imagingImage Analysis and UnderstandingComputer VisionMachine Learning
F
Fabio Gonzalez
Machine Learning, Perception and Discovery Lab - Mindlab, Universidad Nacional de Colombia, Bogotá D.C., Colombia
Eduardo Romero
Eduardo Romero
Universidad Nacional de Colombia
Medical Imaging