π€ AI Summary
To address the dual challenges of accuracy and fairness in expert user identification within community question-answering platforms, this paper introduces ranxβa high-performance ranking evaluation library. Implemented entirely in NumPy and Cython, ranx features a novel sub-millisecond parallel multi-metric computation architecture, enabling pairwise and multi-group statistical significance testing (e.g., Wilcoxon signed-rank test, t-test) across arbitrary numbers of rankers. Through memory mapping and vectorized logic, it achieves extreme computational efficiency. On TREC and MSMARCO benchmarks, ranx outperforms ir-measures and prior versions of ranx by 10β100Γ, supporting real-time evaluation over million-scale queries. Its core innovation lies in the tight integration of ranking evaluation, statistical inference, and systems-level engineering optimization. As an open-source infrastructure, ranx provides both flexibility and scalability, advancing reproducible and verifiable research in information retrieval and recommender systems.