Orchestrating Intelligence: Confidence-Aware Routing for Efficient Multi-Agent Collaboration across Multi-Scale Models

📅 2026-01-08
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the inefficiency in multi-agent systems for complex reasoning tasks, where uniformly deploying large language models (LLMs) leads to excessive computational costs and overlooks the varying cognitive demands across different reasoning stages. To overcome this limitation, the authors propose the OI-MAS framework, which leverages a heterogeneous pool of multi-scale LLMs and integrates state-dependent dynamic routing with a confidence-aware mechanism. This enables adaptive allocation of agent roles and model scales during reasoning, precisely aligning task requirements with model capabilities. Experimental results demonstrate that OI-MAS not only maintains or improves performance but also achieves up to a 12.88% increase in accuracy while reducing computational costs by as much as 79.78%.

Technology Category

Application Category

📝 Abstract
While multi-agent systems (MAS) have demonstrated superior performance over single-agent approaches in complex reasoning tasks, they often suffer from significant computational inefficiencies. Existing frameworks typically deploy large language models (LLMs) uniformly across all agent roles, failing to account for the varying cognitive demands of different reasoning stages. We address this inefficiency by proposing OI-MAS framework, a novel multi-agent framework that implements an adaptive model-selection policy across a heterogeneous pool of multi-scale LLMs. Specifically, OI-MAS introduces a state-dependent routing mechanism that dynamically selects agent roles and model scales throughout the reasoning process. In addition, we introduce a confidence-aware mechanism that selects appropriate model scales conditioned on task complexity, thus reducing unnecessary reliance on large-scale models. Experimental results show that OI-MAS consistently outperforms baseline multi-agent systems, improving accuracy by up to 12.88\% while reducing cost by up to 79.78\%.
Problem

Research questions and friction points this paper is trying to address.

multi-agent systems
computational inefficiency
large language models
reasoning tasks
model selection
Innovation

Methods, ideas, or system contributions that make the work stand out.

confidence-aware routing
multi-agent systems
multi-scale LLMs
adaptive model selection
computational efficiency
🔎 Similar Papers
No similar papers found.
J
Jingbo Wang
Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, China
Sendong Zhao
Sendong Zhao
Harbin Institute of Technology
BioNLPLarge Language Model
J
Jiatong Liu
Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, China
H
Hao Wang
Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, China
W
Wanting Li
the Institute of Automation of the Chinese Academy of Sciences, China
Bing Qin
Bing Qin
Professor in Harbin Institute of Technology
Natural Language ProcessingInformation ExtractionSentiment Analysis
T
Ting Liu
Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, China