GraphTracer: Graph-Guided Failure Tracing in LLM Agents for Robust Multi-Turn Deep Search

📅 2025-10-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In multi-agent systems, errors propagate erroneously across agents during multi-round deep search, hindering root-cause localization. Method: This paper proposes an information-flow–based fault attribution paradigm. Its core innovations are: (1) constructing an Information Dependency Graph (IDG) to explicitly model cross-agent output referencing—replacing temporal sequences and enabling precise discrimination between symptoms and root causes; (2) designing a graph-structure–aware synthetic data generation method to simulate failures at critical nodes; and (3) developing a graph-traversal–based root-cause tracing algorithm, integrated into the multi-agent framework for automated diagnosis. Results: On the Who&When benchmark, the approach achieves up to 18.18% improvement in attribution accuracy; in real-world deployment, it yields performance gains of 4.8%–14.2%.

Technology Category

Application Category

📝 Abstract
Multi-agent systems powered by Large Language Models excel at complex tasks through coordinated collaboration, yet they face high failure rates in multi-turn deep search scenarios. Existing temporal attribution methods struggle to accurately diagnose root causes, particularly when errors propagate across multiple agents. Attempts to automate failure attribution by analyzing action sequences remain ineffective due to their inability to account for information dependencies that span agents. This paper identifies two core challenges: extit{(i) distinguishing symptoms from root causes in multi-agent error propagation}, and extit{(ii) tracing information dependencies beyond temporal order}. To address these issues, we introduce extbf{GraphTracer}, a framework that redefines failure attribution through information flow analysis. GraphTracer constructs Information Dependency Graphs (IDGs) to explicitly capture how agents reference and build on prior outputs. It localizes root causes by tracing through these dependency structures instead of relying on temporal sequences. GraphTracer also uses graph-aware synthetic data generation to target critical nodes, creating realistic failure scenarios. Evaluations on the Who&When benchmark and integration into production systems demonstrate that GraphTracer-8B achieves up to 18.18% higher attribution accuracy compared to state-of-the-art models and enables 4.8% to 14.2% performance improvements in deployed multi-agent frameworks, establishing a robust solution for multi-agent system debugging.
Problem

Research questions and friction points this paper is trying to address.

Diagnosing root causes in multi-agent error propagation
Tracing information dependencies beyond temporal sequences
Improving failure attribution accuracy in multi-turn deep search
Innovation

Methods, ideas, or system contributions that make the work stand out.

Constructs Information Dependency Graphs for explicit capture
Localizes root causes through dependency structures instead of time
Uses graph-aware synthetic data generation for critical nodes
🔎 Similar Papers
No similar papers found.
H
Heng Zhang
South China Normal University, China
Y
Yuling Shi
Shanghai Jiao Tong University, China
Xiaodong Gu
Xiaodong Gu
Associate Professor, Shanghai Jiao Tong University
Software EngineeringLarge Language Models
Haochen You
Haochen You
Columbia University
Generative AIMachine LearningStatistics
Z
Zijian Zhang
University of Pennsylvania, USA
L
Lubin Gan
University of Science and Technology of China, China
Y
Yilei Yuan
University of Michigan, USA
J
Jin Huang
South China Normal University, China