Active Model Selection for Large Language Models

📅 2025-10-10

📈 Citations: 0

✨ Influential: 0

career value

189K/year

🤖 AI Summary

To address the challenge of efficiently selecting the optimal large language model (LLM) for a specific task under limited annotation resources, this paper proposes LLM SELECTOR—the first active learning framework tailored for LLM selection. It employs an adaptive query selection strategy to identify the most discriminative input instances and introduces a lightweight, judge-style oracle model to replace human annotators, substantially reducing annotation costs. Its core contribution lies in the systematic integration of active learning into LLM evaluation and selection—departing from conventional static benchmarking approaches that rely on exhaustive human annotations. Extensive experiments across six benchmarks and 151 LLMs demonstrate that LLM SELECTOR achieves high selection accuracy while reducing annotation requirements by up to 59.62%, thereby enabling a synergistic optimization of both precision and efficiency.

Technology Category

Application Category

📝 Abstract

We introduce LLM SELECTOR, the first framework for active model selection of Large Language Models (LLMs). Unlike prior evaluation and benchmarking approaches that rely on fully annotated datasets, LLM SELECTOR efficiently identifies the best LLM with limited annotations. In particular, for any given task, LLM SELECTOR adaptively selects a small set of queries to annotate that are most informative about the best model for the task. To further reduce annotation cost, we leverage a judge-based oracle annotation model. Through extensive experiments on 6 benchmarks with 151 LLMs, we show that LLM SELECTOR reduces annotation costs by up to 59.62% when selecting the best and near-best LLM for the task.

Problem

Research questions and friction points this paper is trying to address.

Active model selection for Large Language Models with limited annotations

Efficiently identifies optimal LLMs using minimal annotated queries

Reduces annotation costs through adaptive query selection and judge-based oracles

Innovation

Methods, ideas, or system contributions that make the work stand out.

Active selection framework for optimal LLM identification

Adaptive query annotation to minimize labeling costs

Judge-based oracle model for efficient performance evaluation

🔎 Similar Papers

Large Vocabulary Size Improves Large Language Models