CataractCompDetect: Intraoperative Complication Detection in Cataract Surgery

📅 2025-11-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Intraoperative complications—such as iris prolapse, posterior capsule rupture, and vitreous loss—pose major blinding risks during cataract surgery, yet no automated detection methods currently exist. To address this, we propose the first complication detection framework integrating surgical-phase priors with vision-language reasoning: it employs phase-aware localization to guide critical region identification, leverages SAM 2 for high-precision instrument tracking, introduces a complication-specific risk scoring mechanism, and incorporates a vision-language model for event-level semantic reasoning. We further release CataComp, the first publicly available video dataset annotated for intraoperative complications. Evaluated on CataComp, our method achieves a mean F1-score of 70.63%, with 81.8% F1 for iris prolapse detection—marking a substantial improvement in real-time intraoperative risk alerting capability.

Technology Category

Application Category

📝 Abstract
Cataract surgery is one of the most commonly performed surgeries worldwide, yet intraoperative complications such as iris prolapse, posterior capsule rupture (PCR), and vitreous loss remain major causes of adverse outcomes. Automated detection of such events could enable early warning systems and objective training feedback. In this work, we propose CataractCompDetect, a complication detection framework that combines phase-aware localization, SAM 2-based tracking, complication-specific risk scoring, and vision-language reasoning for final classification. To validate CataractCompDetect, we curate CataComp, the first cataract surgery video dataset annotated for intraoperative complications, comprising 53 surgeries, including 23 with clinical complications. On CataComp, CataractCompDetect achieves an average F1 score of 70.63%, with per-complication performance of 81.8% (Iris Prolapse), 60.87% (PCR), and 69.23% (Vitreous Loss). These results highlight the value of combining structured surgical priors with vision-language reasoning for recognizing rare but high-impact intraoperative events. Our dataset and code will be publicly released upon acceptance.
Problem

Research questions and friction points this paper is trying to address.

Detecting intraoperative complications in cataract surgery automatically
Identifying iris prolapse, posterior capsule rupture, and vitreous loss
Enabling early warning systems and objective training feedback
Innovation

Methods, ideas, or system contributions that make the work stand out.

Phase-aware localization for surgical event tracking
SAM 2-based tracking for temporal complication detection
Vision-language reasoning for final complication classification
🔎 Similar Papers
No similar papers found.
B
Bhuvan Sachdeva
Sankara Eye Hospital, Bengaluru, India
S
Sneha Kumari
Sankara Eye Hospital, Bengaluru, India
R
Rudransh Agarwal
Sankara Eye Hospital, Bengaluru, India
S
Shalaka Kumaraswamy
Sankara Eye Hospital, Bengaluru, India
N
Niharika Singri Prasad
Sankara Eye Hospital, Bengaluru, India
S
Simon Mueller
University of Bonn, Bonn, Germany
R
Raphael Lechtenboehmer
University of Bonn, Bonn, Germany
Maximilian W. M. Wintergerst
Maximilian W. M. Wintergerst
University of Bonn, Bonn, Germany
Thomas Schultz
Thomas Schultz
Professor of Life Science Informatics and Visualization, University of Bonn
Medical Image AnalysisVisualizationApplied Machine LearningNeuroimagingOphthalmic Imaging
K
Kaushik Murali
Sankara Eye Hospital, Bengaluru, India
M
Mohit Jain
Microsoft Research, Bengaluru, India