Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval

📅 2024-09-14
🏛️ Engineering applications of artificial intelligence
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work systematically evaluates the transferability of pretrained CNNs (e.g., ResNet) versus vision foundation models (ViTs, DINOv2, SAM, CLIP) as feature extractors for cross-modal, few-shot medical image retrieval (CBMIR), while analyzing the impact of input image resolution. We propose a medical-image-specific feature normalization and metric fusion strategy. To our knowledge, this is the first unified benchmark evaluation of foundation models—including SAM and DINOv2—in CBMIR. Experiments show that DINOv2 achieves R@10 = 72.3% on the RSNA CXR dataset without fine-tuning, substantially outperforming conventional CNNs; enables effective cross-domain retrieval; and accelerates inference by 3.2×. t-SNE visualization and linear probing further confirm its robustness and strong transferability.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Evaluating pre-trained CNNs and foundation models for medical image retrieval
Investigating performance on 2D and 3D medical images across different sizes
Comparing CNN and foundation model effectiveness in content-based retrieval tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Pre-trained CNNs and foundation models for feature extraction
Evaluated performance on 2D and 3D medical images
Investigated image size impact on retrieval performance
A
A. Mahbod
Research Center for Medical Image Analysis and Artificial Intelligence, Department of Medicine, Danube Private University, Krems an der Donau, Austria
N
Nematollah Saeidi
Artificial Intelligence Department, Faculty of Computer Engineering, University of Isfahan, Isfahan, Iran
S
Sepideh Hatamikia
Department of Medicine, Danube Private University, Krems an der Donau, Austria; Austrian Center for Medical Innovation and Technology, Wiener Neustadt, Austria
R
Ramona Woitek
Research Center for Medical Image Analysis and Artificial Intelligence, Department of Medicine, Danube Private University, Krems an der Donau, Austria