Adapting Skill Ratings to Luck-Based Hidden-Information Games

📅 2025-12-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional Elo systems fail to accurately estimate player skill in stochastic, imperfect-information games (e.g., Rummy), as they rely solely on win/loss outcomes and neglect inherent variability in initial game states—particularly hand quality—which confounds skill assessment with luck. Method: We propose an enhanced Elo framework that explicitly models initial hand quality via a computationally efficient hand-evaluation model and a hand-normalized performance metric, thereby decoupling skill from stochasticity. Parameters are calibrated and validated using a large-scale simulation corpus of 270,000 matches across diverse strategic agents. Contribution/Results: Experiments across six strategy-pairing scenarios demonstrate significantly improved rating stability, a 19.3% gain in match outcome prediction accuracy, and markedly superior skill discriminability compared to standard Elo—especially under low-sample-size and high-variance conditions.

Technology Category

Application Category

📝 Abstract
Rating systems play a crucial role in evaluating player skill across competitive environments. The Elo rating system, originally designed for deterministic and information-complete games such as chess, has been widely adopted and modified in various domains. However, the traditional Elo rating system only considers game outcomes for rating calculation and assumes uniform initial states across players. This raises important methodological challenges in skill modelling for popular partially randomized incomplete-information games such as Rummy. In this paper, we examine the limitations of conventional Elo ratings when applied to luck-driven environments and propose a modified Elo framework specifically tailored for Rummy. Our approach incorporates score-based performance metrics and explicitly models the influence of initial hand quality to disentangle skill from luck. Through extensive simulations involving 270,000 games across six strategies of varying sophistication, we demonstrate that our proposed system achieves stable convergence, superior discriminative power, and enhanced predictive accuracy compared to traditional Elo formulations. The framework maintains computational simplicity while effectively capturing the interplay of skill, strategy, and randomness, with broad applicability to other stochastic competitive environments.
Problem

Research questions and friction points this paper is trying to address.

Adapting Elo ratings for luck-based hidden-information games
Incorporating score metrics and initial hand quality to separate skill from luck
Achieving stable convergence and predictive accuracy in stochastic environments
Innovation

Methods, ideas, or system contributions that make the work stand out.

Modified Elo framework for Rummy
Incorporates score metrics and hand quality
Models skill-luck interplay with computational simplicity
🔎 Similar Papers
No similar papers found.
A
Avirup Chakraborty
Indian Statistical Institute Computer Science Engineering, Kolkata, India
S
Shirsa Maitra
Heritage Institute of Technology, India
T
Tathagata Banerjee
Department of Statistics & Data Science, National University of Singapore
D
Diganta Mukherjee
Sampling and Official Statistics Unit, Indian Statistical Institute, Kolkata
Tridib Mukherjee
Tridib Mukherjee
Games24x7
Artificial IntelligenceOutcome-based Interactive PlatformsBehavior Modeling & PersonalizationGame Intelligence & Informat