VIBES - Vision Backbone Efficient Selection

📅 2024-10-11
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Selecting pre-trained vision backbones for few-shot image classification is dataset-dependent and lacks generalizability. Method: This paper proposes an efficient, lightweight, task-oriented backbone selection method that replaces costly exhaustive search or generic benchmark recommendations with a fast, GPU-efficient heuristic evaluation—completed within approximately one hour. It integrates task-aware backbone scoring and ranking to shift the selection paradigm from “generic-benchmark-driven” to “task-performance-driven,” drastically reducing computational overhead. Contribution/Results: Experiments on four standard vision benchmarks demonstrate that the selected backbones consistently achieve higher classification accuracy than those recommended by generic benchmarks, validating both effectiveness and practicality.

Technology Category

Application Category

📝 Abstract
This work tackles the challenge of efficiently selecting high-performance pre-trained vision backbones for specific target tasks. Although exhaustive search within a finite set of backbones can solve this problem, it becomes impractical for large datasets and backbone pools. To address this, we introduce Vision Backbone Efficient Selection (VIBES), which aims to quickly find well-suited backbones, potentially trading off optimality for efficiency. We propose several simple yet effective heuristics to address VIBES and evaluate them across four diverse computer vision datasets. Our results show that these approaches can identify backbones that outperform those selected from generic benchmarks, even within a limited search budget of one hour on a single GPU. We reckon VIBES marks a paradigm shift from benchmarks to task-specific optimization.
Problem

Research questions and friction points this paper is trying to address.

Selecting optimal vision backbones for low-data image classification
Overcoming dataset-dependent backbone performance variability
Efficiently searching large pretrained model pools under computational constraints
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dataset-specific backbone selection method
Heuristic search under computational constraints
Efficient selection from 1300+ pretrained models
🔎 Similar Papers
No similar papers found.
J
Joris Guérin
Espace-Dev, IRD, Univ. Montpellier
S
Shray Bansal
College of Computing, Georgia Institute of Technology
Amirreza Shaban
Amirreza Shaban
VP of Machine Learning, Field AI
Machine LearningComputer Vision
P
Paulo Mann
Institute of Mathematics and Statistics, Rio de Janeiro State University
H
H. Gazula
Massachusetts Institute of Technology