MeepleLM: A Virtual Playtester Simulating Diverse Subjective Experiences

📅 2026-01-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge that existing large language models struggle to provide constructive, experience-grounded feedback for tabletop game design, thereby limiting human-AI collaborative creation. The authors propose MeepleLM, the first framework to integrate modeling of player subjective heterogeneity with the Mechanics–Dynamics–Aesthetics (MDA) framework, enabling the inference of gameplay experience and generation of personalized critiques tailored to distinct player types solely from rulebooks. Their approach encompasses a high-quality rulebook–review dataset, structured rulebook refinement, dimension-aware critique sampling, MDA-informed reasoning enhancement, and player persona distillation via fine-tuning. Experiments demonstrate that MeepleLM significantly outperforms GPT-5.1 and Gemini3-Pro in both community alignment and critique quality, with user studies confirming its efficacy as a virtual playtester at a 70% preference rate.

Technology Category

Application Category

📝 Abstract
Recent advancements have expanded the role of Large Language Models in board games from playing agents to creative co-designers. However, a critical gap remains: current systems lack the capacity to offer constructive critique grounded in the emergent user experience. Bridging this gap is fundamental for harmonizing Human-AI collaboration, as it empowers designers to refine their creations via external perspectives while steering models away from biased or unpredictable outcomes. Automating critique for board games presents two challenges: inferring the latent dynamics connecting rules to gameplay without an explicit engine, and modeling the subjective heterogeneity of diverse player groups. To address these, we curate a dataset of 1,727 structurally corrected rulebooks and 150K reviews selected via quality scoring and facet-aware sampling. We augment this data with Mechanics-Dynamics-Aesthetics (MDA) reasoning to explicitly bridge the causal gap between written rules and player experience. We further distill player personas and introduce MeepleLM, a specialized model that internalizes persona-specific reasoning patterns to accurately simulate the subjective feedback of diverse player archetypes. Experiments demonstrate that MeepleLM significantly outperforms latest commercial models (e.g., GPT-5.1, Gemini3-Pro) in community alignment and critique quality, achieving a 70% preference rate in user studies assessing utility. MeepleLM serves as a reliable virtual playtester for general interactive systems, marking a pivotal step towards audience-aligned, experience-aware Human-AI collaboration.
Problem

Research questions and friction points this paper is trying to address.

board game critique
subjective experience
player personas
Human-AI collaboration
emergent user experience
Innovation

Methods, ideas, or system contributions that make the work stand out.

MeepleLM
player personas
MDA framework
subjective feedback simulation
board game critique
Z
Zizhen Li
Shanda AI Research Tokyo
C
Chuanhao Li
Shanghai AI Laboratory
Yibin Wang
Yibin Wang
Intern at UIUC
Trustworthy AI
Y
Yukang Feng
Shanghai Innovation Institute
Jianwen Sun
Jianwen Sun
Software Engineering Application Technology Lab, Huawei, China
Software engineeringDeep reinforcement learning
J
Jiaxin Ai
Shanghai Innovation Institute
F
Fanrui Zhang
Shanghai Innovation Institute
M
Mingzhu Sun
NKU
Yifei Huang
Yifei Huang
The University of Tokyo
egocentric visiongazevideo understandingembodied ai
Kaipeng Zhang
Kaipeng Zhang
Shanghai AI Laboratory
LLMMultimodal LLMsAIGC