LLM Chemistry Estimation for Multi-LLM Recommendation

📅 2025-10-04

📈 Citations: 0

✨ Influential: 0

career value

202K/year

🤖 AI Summary

Existing approaches to multi-LLM collaboration lack explicit modeling of inter-model complementarity or conflict, leading to unpredictable ensemble outcomes. Method: This paper introduces LLM Chemistry—a novel framework that formally defines “chemical reactions” among large language models, quantifies cooperative or adversarial interactions via dependency analysis, and establishes a task-aware theoretical model of synergistic effects. Contribution/Results: The framework enables predictive modeling and optimization of multi-model combinations across diverse tasks (e.g., classification, summarization, program repair), model scales, and problem complexities. Experiments demonstrate substantial improvements in ensemble performance; moreover, LLM Chemistry provides interpretable diagnostic tools and principled, dynamic model selection criteria for multi-LLM systems—thereby addressing a critical gap in the quantitative, mechanistic analysis of LLM collaboration.

Technology Category

Application Category

📝 Abstract

Multi-LLM collaboration promises accurate, robust, and context-aware solutions, yet existing approaches rely on implicit selection and output assessment without analyzing whether collaborating models truly complement or conflict. We introduce LLM Chemistry -- a framework that measures when LLM combinations exhibit synergistic or antagonistic behaviors that shape collective performance beyond individual capabilities. We formalize the notion of chemistry among LLMs, propose algorithms that quantify it by analyzing interaction dependencies, and recommend optimal model ensembles accordingly. Our theoretical analysis shows that chemistry among collaborating LLMs is most evident under heterogeneous model profiles, with its outcome impact shaped by task type, group size, and complexity. Evaluation on classification, summarization, and program repair tasks provides initial evidence for these task-dependent effects, thereby reinforcing our theoretical results. This establishes LLM Chemistry as both a diagnostic factor in multi-LLM systems and a foundation for ensemble recommendation.

Problem

Research questions and friction points this paper is trying to address.

Quantifying synergistic or antagonistic behaviors in LLM combinations

Formalizing chemistry notion among LLMs through interaction dependencies

Recommending optimal model ensembles based on measured chemistry

Innovation

Methods, ideas, or system contributions that make the work stand out.

Quantifies synergy and antagonism in LLM combinations

Recommends optimal ensembles based on interaction dependencies

Analyzes chemistry effects across tasks and model profiles

🔎 Similar Papers

Text-Augmented Multimodal LLMs for Chemical Reaction Condition Recommendation