RadIR: A Scalable Framework for Multi-Grained Medical Image Retrieval via Radiology Report Mining

📅 2025-03-06

📈 Citations: 0

✨ Influential: 0

career value

213K/year

🤖 AI Summary

Medical image retrieval faces two key challenges: (1) the definition of “similarity” varies across clinical contexts, and (2) the scarcity of large-scale, high-quality benchmark datasets with fine-grained, anatomy-conditioned relevance annotations. To address these, we propose a scalable, multi-granularity medical image retrieval framework. Our method introduces the first automatic similarity annotation paradigm grounded in semantic parsing of radiology reports, enabling anatomy-aware, conditional, fine-grained image ranking across anatomical structures. We construct MIMIC-IR—the first large-scale X-ray retrieval dataset—and CTRATE-IR—the first large-scale CT retrieval dataset—both annotated with anatomy-conditioned relevance rankings. Furthermore, we design an anatomy-aware cross-modal alignment module and a multi-granularity contrastive learning architecture. Our RadIR-CXR and ChestCT models achieve state-of-the-art performance on both image-to-image and image-to-report retrieval tasks, outperforming prior methods on 77 of 78 evaluation metrics, demonstrating substantial improvements in anatomy-conditioned retrieval accuracy.

Technology Category

Application Category

📝 Abstract

Developing advanced medical imaging retrieval systems is challenging due to the varying definitions of `similar images' across different medical contexts. This challenge is compounded by the lack of large-scale, high-quality medical imaging retrieval datasets and benchmarks. In this paper, we propose a novel methodology that leverages dense radiology reports to define image-wise similarity ordering at multiple granularities in a scalable and fully automatic manner. Using this approach, we construct two comprehensive medical imaging retrieval datasets: MIMIC-IR for Chest X-rays and CTRATE-IR for CT scans, providing detailed image-image ranking annotations conditioned on diverse anatomical structures. Furthermore, we develop two retrieval systems, RadIR-CXR and model-ChestCT, which demonstrate superior performance in traditional image-image and image-report retrieval tasks. These systems also enable flexible, effective image retrieval conditioned on specific anatomical structures described in text, achieving state-of-the-art results on 77 out of 78 metrics.

Problem

Research questions and friction points this paper is trying to address.

Defining image similarity in medical contexts is complex.

Lack of large-scale datasets for medical image retrieval.

Proposing scalable, automatic methods for multi-grained image retrieval.

Innovation

Methods, ideas, or system contributions that make the work stand out.

Leverages dense radiology reports for image similarity

Constructs MIMIC-IR and CTRATE-IR datasets automatically

Develops RadIR-CXR and model-ChestCT retrieval systems

🔎 Similar Papers

Leveraging Foundation Models for Content-Based Medical Image Retrieval in Radiology