Scholar
Yuqian Fu
Google Scholar ID: oRcXbE0AAAAJ
Institute of Automation,Chinese Academy of Sciences
Reinforcement Learning
Large Language Model
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
55
H-index
5
i10-index
1
Publications
15
Co-authors
8
list available
Contact
No contact links provided.
Publications
1 items
Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes
2026
Cited
0
Resume (English only)
Co-authors
8 total
Dongbin Zhao
Institute of Automation, Chinese Academy of Sciences
Yuanheng Zhu
Institute of Automation, Chinese Academy of Sciences
Jiajun Chai
Meituan Inc.
Guojun Yin
Meituan, University of Science and Technology of China
Xihuai Wang
Shanghai Jiao Tong University
Jian Zhao
Zhongguancun Institute of Artificial Intelligence
Guohao Li
University of Oxford, CAMEL-AI.org
Bernard Ghanem
Professor, King Abdullah University of Science and Technology
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up