Weakly Supervised Segmentation of Hyper-Reflective Foci with Compact Convolutional Transformers and SAM2

📅 2025-01-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Weakly supervised segmentation of tiny, highly reflective foci (HRFs) in optical coherence tomography (OCT) images remains challenging due to severe downsampling and coarse localization in existing methods, leading to missed detections, inaccurate localization, and loss of fine details. To address this, we propose: (1) an LRP-guided prompting mechanism for SAM2 to enhance spatial localization precision; (2) a Compact Convolutional Transformer architecture—replacing conventional multiple-instance learning (MIL) frameworks—that integrates positional encoding and strengthens long-range feature interactions; and (3) an iterative weakly supervised inference framework. Trained solely with point-level annotations, our method significantly improves both segmentation accuracy and recall for HRFs. It achieves high-resolution, fine-grained localization and segmentation while maintaining low annotation cost and strong generalizability across diverse OCT datasets.

Technology Category

Application Category

📝 Abstract
Weakly supervised segmentation has the potential to greatly reduce the annotation effort for training segmentation models for small structures such as hyper-reflective foci (HRF) in optical coherence tomography (OCT). However, most weakly supervised methods either involve a strong downsampling of input images, or only achieve localization at a coarse resolution, both of which are unsatisfactory for small structures. We propose a novel framework that increases the spatial resolution of a traditional attention-based Multiple Instance Learning (MIL) approach by using Layer-wise Relevance Propagation (LRP) to prompt the Segment Anything Model (SAM~2), and increases recall with iterative inference. Moreover, we demonstrate that replacing MIL with a Compact Convolutional Transformer (CCT), which adds a positional encoding, and permits an exchange of information between different regions of the OCT image, leads to a further and substantial increase in segmentation accuracy.
Problem

Research questions and friction points this paper is trying to address.

Weakly Supervised Segmentation
Precision Improvement
Detail Enhancement
Innovation

Methods, ideas, or system contributions that make the work stand out.

Compact Convolution Transformer (CCT)
Local Relevance Propagation (LRP)
Iterative Inference
🔎 Similar Papers
No similar papers found.
Olivier Morelle
Olivier Morelle
B-IT and Department of Computer Science, University of Bonn
J
Justus Bisten
B-IT and Department of Computer Science, University of Bonn
M
Maximilian W M Wintergerst
Department of Ophthalmology, University Hospital Bonn; Augenzentrum Grischun, Chur, Switzerland
R
Robert P. Finger
Department of Ophthalmology, University Hospital Bonn; Department of Ophthalmology, University Medical Center Mannheim, Heidelberg University
Thomas Schultz
Thomas Schultz
Professor of Life Science Informatics and Visualization, University of Bonn
Medical Image AnalysisVisualizationApplied Machine LearningNeuroimagingOphthalmic Imaging