Managing Diabetic Retinopathy with Deep Learning: A Data Centric Overview

πŸ“… 2026-04-02
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This study addresses the clinical reliability limitations of automated diabetic retinopathy (DR) detection, which stem from geographical biases, insufficient sample sizes, inconsistent annotations, and variable image quality in existing fundus datasets. For the first time, it provides a systematic review and comparative analysis of DR-related datasets from a data-centric perspective, categorizing them by scale, accessibility, and annotation type, and evaluating their suitability for tasks such as binary classification, severity grading, lesion localization, and multi-disease screening. The work identifies the widespread absence of standardized lesion-level annotations and longitudinal data, distills key characteristics of high-quality DR datasets, and offers concrete recommendations for constructing future datasets that enhance clinical reliability and model interpretability, thereby guiding the development of robust AI-assisted screening systems.
πŸ“ Abstract
Diabetic Retinopathy (DR) is a serious microvascular complication of diabetes, and one of the leading causes of vision loss worldwide. Although automated detection and grading, with Deep Learning (DL), can reduce the burden on ophthalmologists, it is constrained by the limited availability of high-quality datasets. Existing repositories often remain geographically narrow, contain limited samples, and exhibit inconsistent annotations or variable image quality; thereby, restricting their clinical reliability. This paper presents a comprehensive review and comparative analysis of fundus image datasets used in the management of DR. The study evaluates their usability across key tasks, including binary classification, severity grading, lesion localization, and multi-disease screening. It also categorizes the datasets by size, accessibility, and annotation type (such as image-level, lesion-level, and multi-disease). Finally, a recently published dataset is presented as a case study to illustrate broader challenges in dataset curation and usage. The review consolidates current knowledge while highlighting persistent gaps such as the lack of standardized lesion-level annotations and longitudinal data. It also outlines recommendations for future dataset development to support clinically reliable and explainable solutions in DR screening.
Problem

Research questions and friction points this paper is trying to address.

Diabetic Retinopathy
Deep Learning
Fundus Image Datasets
Data Quality
Clinical Reliability
Innovation

Methods, ideas, or system contributions that make the work stand out.

data-centric review
diabetic retinopathy
fundus image datasets
lesion-level annotation
dataset curation
πŸ”Ž Similar Papers
No similar papers found.
Shramana Dey
Shramana Dey
Junior Research Fellow, Indian Statistical Institute
Machine LearningDeep LearningMedical Image Analysis
Z
Zahir Khan
Indian Statistical Institute, Kolkata, West Bengal, India
T
T. A. PramodKumar
Dr. Mohan’s Diabetes Specialties Centre, Madras Diabetes Research Foundation, Chennai, Tamil Nadu, India
B. Uma Shankar
B. Uma Shankar
Visiting Professor (Honorary), Indian Statistical Institute, Kolkata, India
Machine Learning(ML-DL)Pattern RecognitionImage ProcessingFuzzy Sets & Rough Sets
A
Ashis K. Dhara
Department of Electrical Engineering, National Institute of Technology, Durgapur, West Bengal, India
R
Ramachandran Rajalakshmi
Dr. Mohan’s Diabetes Specialties Centre, Madras Diabetes Research Foundation, Chennai, Tamil Nadu, India
Rajiv Raman
Rajiv Raman
Senior Consultant, Shri Bhagwan Mahavir Vitreoretinal services
Diabetic Retinopathy
S
Sushmita Mitra
Indian Statistical Institute, Kolkata, West Bengal, India