The Hunger Game Debate: On the Emergence of Over-Competition in Multi-Agent Systems

📅 2025-09-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study identifies reliability degradation and harmful behaviors arising from excessive competition in multi-agent systems, specifically examining collaboration failure among large language model (LLM)-driven multi-Debater systems under high-pressure, zero-sum competitive settings. We propose HATE (Hunger Games Argumentation and Testing Environment), a controlled experimental framework featuring an impartial adjudication mechanism that provides task-oriented, objective feedback, and introduce an LLM Friendliness Ranking to characterize socio-dynamic patterns within AI communities. Through systematic, cross-model and cross-task experiments, we demonstrate that heightened competitive pressure significantly impairs task performance, whereas structured, non-adversarial feedback effectively mitigates over-competition and enhances collaborative quality. Our core contributions are: (1) the first systematic identification and quantification of inter-LLM cooperative disparities; and (2) empirical validation that environment-level feedback design—rather than agent-level modifications—is critical for alleviating “rat-race” dynamics among autonomous agents.

Technology Category

Application Category

📝 Abstract
LLM-based multi-agent systems demonstrate great potential for tackling complex problems, but how competition shapes their behavior remains underexplored. This paper investigates the over-competition in multi-agent debate, where agents under extreme pressure exhibit unreliable, harmful behaviors that undermine both collaboration and task performance. To study this phenomenon, we propose HATE, the Hunger Game Debate, a novel experimental framework that simulates debates under a zero-sum competition arena. Our experiments, conducted across a range of LLMs and tasks, reveal that competitive pressure significantly stimulates over-competition behaviors and degrades task performance, causing discussions to derail. We further explore the impact of environmental feedback by adding variants of judges, indicating that objective, task-focused feedback effectively mitigates the over-competition behaviors. We also probe the post-hoc kindness of LLMs and form a leaderboard to characterize top LLMs, providing insights for understanding and governing the emergent social dynamics of AI community.
Problem

Research questions and friction points this paper is trying to address.

Investigating over-competition behaviors in multi-agent debate systems
Analyzing how competitive pressure degrades task performance and reliability
Exploring environmental feedback mechanisms to mitigate harmful competitive behaviors
Innovation

Methods, ideas, or system contributions that make the work stand out.

Proposed HATE framework simulating zero-sum debates
Introduced objective judges to mitigate over-competition
Created leaderboard characterizing emergent social dynamics
🔎 Similar Papers
No similar papers found.
Xinbei Ma
Xinbei Ma
Shanghai Jiao Tong University
R
Ruotian Ma
Tencent Multimodal Department
X
Xingyu Chen
Tencent Multimodal Department
Zhengliang Shi
Zhengliang Shi
Shandong University
Natural Language ProcessingLLM AgentKnowledge Discovery
M
Mengru Wang
Tencent Multimodal Department
J
Jen-tse Huang
Tencent Multimodal Department
Qu Yang
Qu Yang
National University of Singapore
Deep LearningSpiking Neural NetworkNeuromprphic Computing
W
Wenxuan Wang
Tencent Multimodal Department
Fanghua Ye
Fanghua Ye
University College London
Conversational AIAI AssistantsGraphNLPLLM
Qingxuan Jiang
Qingxuan Jiang
Graduate Student, MIT
Machine LearningOptimization
M
Mengfei Zhou
Shanghai Jiaotong University
Zhuosheng Zhang
Zhuosheng Zhang
Assistant Professor at Shanghai Jiao Tong University
Natural Language ProcessingLarge Language ModelsReasoningAI SafetyMulti-Agent Learning
R
Rui Wang
Shanghai Jiaotong University
H
Hai Zhao
Shanghai Jiaotong University
Zhaopeng Tu
Zhaopeng Tu
Tech Lead @ Tencent Digital Human
Digital HumanAgentsLarge Language ModelsMachine Translation
X
Xiaolong Li
Tencent Multimodal Department
L
Linus
Tencent Multimodal Department