Joint Segmentation and Grading with Iterative Optimization for Multimodal Glaucoma Diagnosis

📅 2026-03-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Early diagnosis of glaucoma is often hindered by insufficient information from a single imaging modality, leading to missed detections. To address this challenge, this work proposes an Iterative Multimodal Optimization (IMO) model that, for the first time, incorporates a denoising diffusion mechanism into multimodal joint glaucoma diagnosis. The model employs an intermediate fusion strategy to integrate fundus and OCT images, introduces a cross-modal feature alignment module to mitigate modality discrepancies, and simultaneously refines optic disc/cup segmentation and glaucoma classification within an iterative refinement decoder. This synergistic design enables mutual enhancement between segmentation and classification tasks, achieving state-of-the-art performance on both. The approach significantly improves multimodal feature integration and offers a reliable, comprehensive assessment framework for clinical use.

Technology Category

Application Category

📝 Abstract
Accurate diagnosis of glaucoma is challenging, as early-stage changes are subtle and often lack clear structural or appearance cues. Most existing approaches rely on a single modality, such as fundus or optical coherence tomography (OCT), capturing only partial pathological information and often missing early disease progression. In this paper, we propose an iterative multimodal optimization model (IMO) for joint segmentation and grading. IMO integrates fundus and OCT features through a mid-level fusion strategy, enhanced by a cross-modal feature alignment (CMFA) module to reduce modality discrepancies. An iterative refinement decoder progressively optimizes the multimodal features through a denoising diffusion mechanism, enabling fine-grained segmentation of the optic disc and cup while supporting accurate glaucoma grading. Extensive experiments show that our method effectively integrates multimodal features, providing a comprehensive and clinically significant approach to glaucoma assessment. Source codes are available at https://github.com/warren-wzw/IMO.git.
Problem

Research questions and friction points this paper is trying to address.

glaucoma diagnosis
multimodal fusion
early-stage detection
optic disc segmentation
disease grading
Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal fusion
cross-modal alignment
iterative optimization
denoising diffusion
joint segmentation and grading
🔎 Similar Papers
No similar papers found.
Z
Zhiwei Wang
Department of Electrical and Electronic Engineering, The University of Hong Kong, Hong Kong, China
Y
Yuxing Li
Department of Electrical and Electronic Engineering, The University of Hong Kong, Hong Kong, China
Meilu Zhu
Meilu Zhu
City University of Hong Kong
Machine LearningDeep LearningComputer VisionImage ProcessingFederated Learning
D
Defeng He
College of Information Engineering, Zhejiang University of Technology, Hangzhou, China
E
Edmund Y. Lam
Department of Electrical and Electronic Engineering, The University of Hong Kong, Hong Kong, China