Generalized Proof-Number Monte-Carlo Tree Search

📅 2025-06-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Integrating Proof-Number Search (PNS) and Monte-Carlo Tree Search (MCTS) remains challenging in multi-player (≥3 players) games due to incompatible search semantics and the lack of a principled framework for handling non-zero-sum, continuous-scoring, and adversarial interactions. Method: We propose Generalized Proof-Number Search with Monte-Carlo Tree Search (GPNS-MCTS), the first approach to maintain player-specific proof numbers, extending PNS’s binary proof/disproof capability to score-bounded reasoning. GPNS-MCTS combines proof-number-guided UCB selection with Score-Bounded MCTS to jointly establish win/loss conclusions and tight score bounds. Contribution/Results: Evaluated on 11 multi-player board games, GPNS-MCTS achieves an 80% win-rate improvement over conventional PNS-MCTS hybrids in 8 games. It significantly enhances generality and performance in continuous-scoring, non-zero-sum, and multi-adversary settings, establishing a new state-of-the-art for rigorous, bounded-rational decision-making in complex multi-player games.

Technology Category

Application Category

📝 Abstract
This paper presents Generalized Proof-Number Monte-Carlo Tree Search: a generalization of recently proposed combinations of Proof-Number Search (PNS) with Monte-Carlo Tree Search (MCTS), which use (dis)proof numbers to bias UCB1-based Selection strategies towards parts of the search that are expected to be easily (dis)proven. We propose three core modifications of prior combinations of PNS with MCTS. First, we track proof numbers per player. This reduces code complexity in the sense that we no longer need disproof numbers, and generalizes the technique to be applicable to games with more than two players. Second, we propose and extensively evaluate different methods of using proof numbers to bias the selection strategy, achieving strong performance with strategies that are simpler to implement and compute. Third, we merge our technique with Score Bounded MCTS, enabling the algorithm to prove and leverage upper and lower bounds on scores - as opposed to only proving wins or not-wins. Experiments demonstrate substantial performance increases, reaching the range of 80% for 8 out of the 11 tested board games.
Problem

Research questions and friction points this paper is trying to address.

Generalizes PNS and MCTS for multi-player games
Simplifies proof number usage in selection strategies
Extends algorithm to prove score bounds, not just wins
Innovation

Methods, ideas, or system contributions that make the work stand out.

Track proof numbers per player
Use proof numbers to bias selection
Merge with Score Bounded MCTS
🔎 Similar Papers
No similar papers found.
Jakub Kowalski
Jakub Kowalski
Institute of Computer Science, University of Wrocław, Poland
Artificial IntelligenceGamesComputational Intelligence and GamesProcedural Content GenerationSynchronizing Automata
D
Dennis J. N. J. Soemers
Maastricht University
S
Szymon Kosakowski
University of Wrocław
M
Mark H. M. Winands
Maastricht University