Tongyi DeepResearch Technical Report

📅 2025-10-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
For long-horizon, deep information retrieval and autonomous research tasks, this paper introduces an agent-oriented large language model (Agent LLM) designed to realize end-to-end reasoning, proactive information acquisition, and continual planning. Methodologically, we propose a two-stage training paradigm—“agent-in-the-loop pretraining + post-training”—integrated with a fully automated, scalable data synthesis pipeline that requires no human annotation, and employ a sparse-activation architecture (30.5B total parameters, 3.3B activated per token). Our contributions include: (1) the first end-to-end training framework specifically tailored for deep research; (2) open-sourcing the model weights and a comprehensive toolchain supporting research agent development; and (3) state-of-the-art performance on benchmarks including Humanity’s Last Exam and BrowseComp.

Technology Category

Application Category

📝 Abstract
We present Tongyi DeepResearch, an agentic large language model, which is specifically designed for long-horizon, deep information-seeking research tasks. To incentivize autonomous deep research agency, Tongyi DeepResearch is developed through an end-to-end training framework that combines agentic mid-training and agentic post-training, enabling scalable reasoning and information seeking across complex tasks. We design a highly scalable data synthesis pipeline that is fully automatic, without relying on costly human annotation, and empowers all training stages. By constructing customized environments for each stage, our system enables stable and consistent interactions throughout. Tongyi DeepResearch, featuring 30.5 billion total parameters, with only 3.3 billion activated per token, achieves state-of-the-art performance across a range of agentic deep research benchmarks, including Humanity's Last Exam, BrowseComp, BrowseComp-ZH, WebWalkerQA, xbench-DeepSearch, FRAMES and xbench-DeepSearch-2510. We open-source the model, framework, and complete solutions to empower the community.
Problem

Research questions and friction points this paper is trying to address.

Developing an agentic LLM for long-horizon deep research tasks
Creating scalable autonomous reasoning across complex information-seeking scenarios
Achieving state-of-the-art performance on agentic deep research benchmarks
Innovation

Methods, ideas, or system contributions that make the work stand out.

End-to-end training framework with agentic stages
Fully automatic scalable data synthesis pipeline
Customized environments for stable agent interactions
🔎 Similar Papers
No similar papers found.
Baixuan Li
Baixuan Li
Master's Student, Southeast University
Open-Domain QAAgent LearningRAGLLMsText Embedding
B
Bo Zhang
Tongyi Lab, Alibaba Group
D
Dingchu Zhang
Tongyi Lab, Alibaba Group
F
Fei Huang
Tongyi Lab, Alibaba Group
Guangyu Li
Guangyu Li
New York University
Recommendation SystemSocial NetworksNetwork Caching System
G
Guoxin Chen
Tongyi Lab, Alibaba Group
H
Huifeng Yin
Tongyi Lab, Alibaba Group
J
Jialong Wu
Tongyi Lab, Alibaba Group
Jingren Zhou
Jingren Zhou
Alibaba Group, Microsoft
Cloud ComputingLarge Scale Distributed SystemsMachine LearningQuery ProcessingQuery
Kuan Li
Kuan Li
Hong Kong University of Science and Technology (HKUST)
LLM agentmachine learning on graphsadversarial robustness
Liangcai Su
Liangcai Su
The University of Hong Kong, Tsinghua University
Data MiningLarge Language ModelsDeep Research Agents
Litu Ou
Litu Ou
University of Edinburgh
Natural Language ProcessingMachine LearningInformation Retrieval
L
Liwen Zhang
Tongyi Lab, Alibaba Group
Pengjun Xie
Pengjun Xie
Alibaba Group
NLP/IR/ML
R
Rui Ye
Tongyi Lab, Alibaba Group
Wenbiao Yin
Wenbiao Yin
Tongyi Lab, Alibaba Group
LLMAgentRAG
X
Xinmiao Yu
Tongyi Lab, Alibaba Group
X
Xinyu Wang
Tongyi Lab, Alibaba Group
Xixi Wu
Xixi Wu
The Chinese University of Hong Kong
Graph Neural NetworksData MiningLarge Language Models
Xuanzhong Chen
Xuanzhong Chen
Tsinghua University
AI for HealthcareLarge Language ModelsMachine Learning
Yida Zhao
Yida Zhao
ShanghaiTech University
Natural Language Processing
Z
Zhen Zhang
Tongyi Lab, Alibaba Group
Zhengwei Tao
Zhengwei Tao
Peking University
AgentData Synthesis
Zhongwang Zhang
Zhongwang Zhang
Shanghai Jiao Tong University
Zile Qiao
Zile Qiao
Alibaba Tongyi Lab; Peking University
C
Chenxi Wang
Tongyi Lab, Alibaba Group
Donglei Yu
Donglei Yu
Institute of Automation, Chinese Academy of Sciences
simultaneous machine translationlarge language model
Gang Fu
Gang Fu
Amazon
Machine LearningDeep LearningSemantic Network Analysis
H
Haiyang Shen
Tongyi Lab, Alibaba Group
J
Jiayin Yang
Tongyi Lab, Alibaba Group
J
Jun Lin
Tongyi Lab, Alibaba Group
J
Junkai Zhang
Tongyi Lab, Alibaba Group
K
Kuijie Zeng
Tongyi Lab, Alibaba Group
L
Li Yang
Tongyi Lab, Alibaba Group
H
Hailong Yin
Tongyi Lab, Alibaba Group
Maojia Song
Maojia Song
University of Leeds
Adaptive IntelligenceNatural Language ProcessMultimodal InteractionQuestion Answering
M
Ming Yan
Tongyi Lab, Alibaba Group
Peng Xia
Peng Xia
PhD student, Department of Computer Science, UNC Chapel Hill
Multimodal AgentHealthcare
Qian Xiao
Qian Xiao
Shanghai Jiao Tong University
Statistics
Rui Min
Rui Min
Hong Kong University of Science and Technology
Machine LearningAgentTrustworthy AI
Rui Ding
Rui Ding
Principal Researcher, Microsoft
Causal DiscoveryCausal InferenceAdvanced Data Analysis
Runnan Fang
Runnan Fang
Zhejiang University
Tool learningAgent
Shaowei Chen
Shaowei Chen
Tongyi Lab, Alibaba Group
Shen Huang
Shen Huang
Director of Search, Yihaodian.com
Machine learningdata miningsearchrecommendationpersonalization
Shihang Wang
Shihang Wang
DAMO Academy, Alibaba Inc.
Natural Language Processing
Shihao Cai
Shihao Cai
University of Science and Technology of China
large language modelsrecommendation
Weizhou Shen
Weizhou Shen
Tongyi Lab, Alibaba Group
X
Xiaobin Wang
Tongyi Lab, Alibaba Group
Xin Guan
Xin Guan
Research, Holistic AI
Ethical AI and Normative Reasoning
X
Xinyu Geng
Tongyi Lab, Alibaba Group
Y
Yingcheng Shi
Tongyi Lab, Alibaba Group
Yuning Wu
Yuning Wu
Wayne State University
perceptions of crime & justicepolice attitudes and behaviorsvictimizationcriminological theorieslaw and society
Z
Zhuo Chen
Tongyi Lab, Alibaba Group
Z
Zijian Li
Tongyi Lab, Alibaba Group
Y
Yong Jiang
Tongyi Lab, Alibaba Group