Task-Aware LLM Council with Adaptive Decision Pathways for Decision Support

📅 2026-01-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limitation of existing approaches that overlook the specialization disparities among large language models (LLMs) across diverse tasks, thereby struggling to balance varied reasoning demands and task complexity. To this end, the authors propose a task-aware LLM committee framework that integrates Monte Carlo Tree Search (MCTS) with a structured archive of successful execution trajectories. The framework dynamically selects the most suitable expert model by semantically matching current task contexts with historical successful paths. Furthermore, it introduces an adaptive dual-signal weighting mechanism that combines real-time model evaluation with historical utility to guide decision-making. Experiments on benchmarks including WebShop, HumanEval, and the 24 Game demonstrate significant improvements in both task success rates and search efficiency, outperforming strong baselines.

Technology Category

Application Category

📝 Abstract
Large language models (LLMs) have shown strong capabilities across diverse decision-making tasks. However, existing approaches often overlook the specialization differences among available models, treating all LLMs as uniformly applicable regardless of task characteristics. This limits their ability to adapt to varying reasoning demands and task complexities. In this work, we propose Task-Aware LLM Council (TALC), a task-adaptive decision framework that integrates a council of LLMs with Monte Carlo Tree Search (MCTS) to enable dynamic expert selection and efficient multi-step planning. Each LLM is equipped with a structured success memory profile derived from prior task trajectories, enabling semantic matching between current reasoning context and past successes. At each decision point, TALC routes control to the most contextually appropriate model and estimates node value using a dual-signal mechanism that fuses model-based evaluations with historical utility scores. These signals are adaptively weighted based on intra-node variance and used to guide MCTS selection, allowing the system to balance exploration depth with planning confidence. Experiments on WebShop, HumanEval, and the Game of 24 demonstrate that TALC achieves superior task success rates and improved search efficiency compared to strong baselines, validating the benefits of specialization-aware routing and adaptive planning.
Problem

Research questions and friction points this paper is trying to address.

large language models
task specialization
decision support
adaptive reasoning
model selection
Innovation

Methods, ideas, or system contributions that make the work stand out.

Task-Aware LLM Council
Adaptive Decision Pathways
Monte Carlo Tree Search
Success Memory Profile
Specialization-Aware Routing
🔎 Similar Papers
No similar papers found.
W
Wei Zhu
School of Information Science and Engineering, Yunnan University, Kunming, China; Yunnan Key Laboratory of Intelligent Systems and Computing, Kunming, China
Lixing Yu
Lixing Yu
Yunnan University
Distributed Machine Learning
Hao-Ren Yao
Hao-Ren Yao
Carnegie Mellon University
Health InformaticsMachine LearningData Mining
Zhiwen Tang
Zhiwen Tang
Yunnan University
Kun Yue
Kun Yue
Tsinghua University
HCI AR BCI XR