Using Common Random Numbers for Simulation-based Planning with Rollouts

📅 2026-05-06
📈 Citations: 0
Influential: 0
📄 PDF

career value

157K/year
📝 Abstract
Simulation-based planning with rollouts is a widely-deployed technique for decision making in stochastic environments. The primary instrument of simulation-based planning is a sampling model, which is repeatedly called to generate trajectories and estimate the utilities of available actions. Among the actions thus explored, one with the maximum estimated utility is then executed. In this paper, we examine the effect of using common random numbers in the simulation process. We obtain a simple recipe for (provably) reducing variance in relative utility when simulations invoke a rollout policy beyond some depth. Experiments on synthetic tasks confirm that our scheme improves task performance. The broader significance of our innovation is apparent from two practical applications: (1) single-step lookahead planning in a pension-disbursement task, and (2) a deployment of the well-known UCT algorithm for the game of Ludo.
Problem

Research questions and friction points this paper is trying to address.

simulation-based planning
rollouts
common random numbers
variance reduction
stochastic decision making
Innovation

Methods, ideas, or system contributions that make the work stand out.

Common Random Numbers
Simulation-based Planning
Rollout Policy
Variance Reduction
UCT Algorithm
🔎 Similar Papers
No similar papers found.