JoyAgent-JDGenie: Technical Report on the GAIA

📅 2025-10-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing autonomous agents suffer from limited robustness, poor adaptability, and fragmented design paradigms. Method: This paper proposes a unified agent architecture featuring: (1) a collective multi-agent collaboration framework with a critic-model voting mechanism to enhance decision reliability; (2) a three-tier hierarchical memory system—comprising working, semantic, and procedural memory—to enable dynamic memory management and cross-task knowledge reuse; and (3) integrated planning-execution dual agents, tool augmentation (web search, code execution, multimodal parsing), and modular system fusion. Contribution/Results: Evaluated on comprehensive benchmarks, the architecture consistently outperforms leading open-source baselines and approaches the performance of commercial closed-source systems, demonstrating strong generalization across domains, scalability, and robust adaptive capability.

Technology Category

Application Category

📝 Abstract
Large Language Models are increasingly deployed as autonomous agents for complex real-world tasks, yet existing systems often focus on isolated improvements without a unifying design for robustness and adaptability. We propose a generalist agent architecture that integrates three core components: a collective multi-agent framework combining planning and execution agents with critic model voting, a hierarchical memory system spanning working, semantic, and procedural layers, and a refined tool suite for search, code execution, and multimodal parsing. Evaluated on a comprehensive benchmark, our framework consistently outperforms open-source baselines and approaches the performance of proprietary systems. These results demonstrate the importance of system-level integration and highlight a path toward scalable, resilient, and adaptive AI assistants capable of operating across diverse domains and tasks.
Problem

Research questions and friction points this paper is trying to address.

Developing robust autonomous agents for complex real-world tasks
Integrating multi-agent planning with hierarchical memory systems
Creating scalable AI assistants adaptable across diverse domains
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-agent framework with planning and critic voting
Hierarchical memory system across multiple layers
Refined tool suite for search and code execution
J
Jiarun Liu
S
Shiyue Xu
S
Shangkun Liu
Y
Yang Li
W
Wen Liu
M
Min Liu
X
Xiaoqing Zhou
H
Hanmin Wang
S
Shilin Jia
Z
zhen Wang
S
Shaohua Tian
Hanhao Li
Hanhao Li
香港中文大学
J
Junbo Zhang
Y
Yongli Yu
P
Peng Cao
Haofen Wang
Haofen Wang
Tongji University
Knowledge GraphNatural Language ProcessingRetrieval Augmented Generation