BALAR : A Bayesian Agentic Loop for Active Reasoning

๐Ÿ“… 2026-05-06
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

206K/year
๐Ÿค– AI Summary
This work addresses the limited active reasoning capability of large language models (LLMs) in interactive tasks, particularly their inability to recognize missing information and ask effective clarifying questions. The authors propose a general, fine-tuning-free outer-loop algorithm that introduces Bayesian active inference into LLM-based interaction for the first time. By maintaining a belief over latent task states, selecting clarification questions that maximize expected mutual information, and dynamically expanding the state representation, the method enables task-agnostic, multi-turn structured dialogue. Evaluated on three benchmarksโ€”AR-Bench-DC, AR-Bench-SP, and iCraft-MDโ€”the approach achieves accuracy improvements of 14.6%, 38.5%, and 30.5%, respectively, substantially outperforming existing baselines.
๐Ÿ“ Abstract
Large language models increasingly operate in interactive settings where solving a task requires multiple rounds of information exchange with a user. However, most current systems treat dialogue reactively and lack a principled mechanism to reason about what information is missing and which question should be asked next. We propose BALAR (Bayesian Agentic Loop for Active Reasoning), a task-agnostic outer-loop algorithm that requires no fine-tuning and enables structured multi-turn interaction between an LLM agent and a user. BALAR maintains a structured belief over latent states, selects clarifying questions by maximizing expected mutual information, and dynamically expands its state representation when the current one proves insufficient. We evaluate BALAR on three diverse benchmarks: AR-Bench-DC (detective cases), AR-Bench-SP (thinking puzzles), and iCraft-MD (clinical diagnosis). BALAR significantly outperforms all baselines across all three benchmarks, with $14.6\%$ higher accuracy on AR-Bench-DC, $38.5\%$ on AR-Bench-SP, and $30.5\%$ on iCraft-MD.
Problem

Research questions and friction points this paper is trying to address.

active reasoning
interactive dialogue
information gap
question selection
latent state
Innovation

Methods, ideas, or system contributions that make the work stand out.

Bayesian reasoning
active questioning
multi-turn interaction
latent state belief
expected mutual information