Breaking Flat: A Generalised Query Performance Prediction Evaluation Framework

📅 2026-01-24

📈 Citations: 0

✨ Influential: 0

career value

181K/year

🤖 AI Summary

Traditional query performance prediction (QPP) methods assess query difficulty solely within the context of a single ranker, making them ill-suited for fine-grained selection of the optimal ranker for a given query. This work generalizes the QPP task into three evaluation settings: single-ranker multi-query, multi-ranker single-query, and multi-ranker multi-query, and introduces the first unified QPP evaluation framework tailored for multi-ranker scenarios. Through systematic evaluation of standard QPP methods across diverse ranker–query combinations, the study reveals significant performance disparities among QPP models under different settings and demonstrates that predicting the best-performing ranker is notably more challenging than predicting query difficulty. This research establishes a new paradigm for developing QPP systems with stronger generalization and practical utility.

Technology Category

Application Category

📝 Abstract

The traditional use-case of query performance prediction (QPP) is to identify which queries perform well and which perform poorly for a given ranking model. A more fine-grained and arguably more challenging extension of this task is to determine which ranking models are most effective for a given query. In this work, we generalize the QPP task and its evaluation into three settings: (i) SingleRanker MultiQuery (SRMQ-PP), corresponding to the standard use case; (ii) MultiRanker SingleQuery (MRSQ-PP), which evaluates a QPP model's ability to select the most effective ranker for a query; and (iii) MultiRanker MultiQuery (MRMQ-PP), which considers predictions jointly across all query ranker pairs. Our results show that (a) the relative effectiveness of QPP models varies substantially across tasks (SRMQ-PP vs. MRSQ-PP), and (b) predicting the best ranker for a query is considerably more difficult than predicting the relative difficulty of queries for a given ranker.

Problem

Research questions and friction points this paper is trying to address.

Query Performance Prediction

Ranking Models

Evaluation Framework

MultiRanker

SingleQuery

Innovation

Methods, ideas, or system contributions that make the work stand out.

Query Performance Prediction

Ranker Selection

Evaluation Framework