๐ค AI Summary
Existing global-retrievability-based measures of document exposure fairness conflate exposure bias with topic-specific relevance differences, leading to distorted evaluations. To address this, we propose T-Retrievabilityโa topic-aware, local retrievability metric: documents are first clustered by topic; within each cluster, local retrievability is computed; finally, these local scores are aggregated into a global fairness indicator. Unlike conventional global approaches, T-Retrievability enables precise detection of latent exposure disparities across topics induced by neural ranking models. Experiments demonstrate that T-Retrievability significantly enhances the granularity and robustness of fairness analysis, effectively uncovering subtle yet critical exposure differences among models. It thus establishes a more reliable, interpretable, and topic-sensitive benchmark for fairness assessment in information retrieval.
๐ Abstract
Retrievability of a document is a collection-based statistic that measures its expected (reciprocal) rank of being retrieved within a specific rank cut-off. A collection with uniformly distributed retrievability scores across documents is an indicator of fair document exposure. While retrievability scores have been used to quantify the fairness of exposure for a collection, in our work, we use the distribution of retrievability scores to measure the exposure bias of retrieval models. We hypothesise that an uneven distribution of retrievability scores across the entire collection may not accurately reflect exposure bias but rather indicate variations in topical relevance. As a solution, we propose a topic-focused localised retrievability measure, which we call extit{T-Retrievability} (topic-retrievability), which first computes retrievability scores over multiple groups of topically-related documents, and then aggregates these localised values to obtain the collection-level statistics. Our analysis using this proposed T-Retrievability measure uncovers new insights into the exposure characteristics of various neural ranking models. The findings suggest that this localised measure provides a more nuanced understanding of exposure fairness, offering a more reliable approach for assessing document accessibility in IR systems.