When Independent Sampling Outperforms Agentic Reasoning

πŸ“… 2026-05-08
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

221K/year
πŸ€– AI Summary
This study addresses the optimal allocation of computational resources under a fixed inference budget to maximize solution accuracy on programming competition problems. Through systematic experiments on 216 Codeforces problems spanning multiple difficulty levels, the authors compare agent-based reasoning against independent repeated sampling (k-shot) across varying numbers of model calls and associated costs, incorporating a prompt caching mechanism. The results consistently demonstrate that k-shot outperforms more complex agent-based approaches in both accuracy-cost and accuracy-query trade-offs. The work proposes β€œlog-failure-likelihood per dollar” as a principled optimization criterion for solver design, revealing that in self-contained algorithmic tasks, simple sampling strategies are more cost-effective than elaborate reasoning frameworks. This finding establishes a new paradigm for code generation under resource constraints.
πŸ“ Abstract
We study how to allocate inference-time compute for competitive programming under fixed budgets. Evaluating 216 Codeforces problems across Divisions 1-3, we compare agent-based reasoning with repeated independent sampling (k-shot) as a function of both cost and number of model calls. Across models and difficulty levels, k-shot consistently achieves a better accuracy-cost and accuracy-query tradeoff. This gap persists despite prompt caching in agent frameworks, indicating lower per-call effectiveness. Our results show that, for self-contained algorithmic tasks, independent exploration can outperform deeper agentic reasoning under realistic resource constraints. We also provide a budget-allocation analysis when the inference budget is fixed, and prove that a cost-optimal solver minimizes the principled metric log failure likelihood per dollar.
Problem

Research questions and friction points this paper is trying to address.

inference-time compute allocation
competitive programming
accuracy-cost tradeoff
resource constraints
algorithmic reasoning
Innovation

Methods, ideas, or system contributions that make the work stand out.

independent sampling
agentic reasoning
inference-time compute allocation
cost-accuracy tradeoff
log failure likelihood per dollar