International Conference on Learning Representations · 2023
Cited
932
Resume (English only)
Academic Achievements
Publications: 'ReTool: Reinforcement Learning for Strategic Tool Use in LLMs' (Arxiv '25), 'UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning' (Arxiv '25), 'WizardLM: Empowering LLMs to follow complex instructions' (ICLR '24), 'LoGiPT: Logical reasoning with LLMs' (NAACL '24), 'InteR: LLMs for IR' (ACL '24), etc.
Research Experience
Research Intern at Microsoft Azure AI, advised by Dr. Ruochen Xu, Dr. Yelong Shen, and Dr. Weizhu Chen; Research Intern at Microsoft Research Asia (MSRA).
Education
Ph.D. student at the School of Intelligence Science and Technology, Peking University, advised by Prof. Dongyan Zhao and Prof. Rui Yan; Visiting Student at the University of Oxford, advised by Prof. Yee Whye Teh.
Background
Research Interests: Multimodal agents and machine learning. Brief Introduction: Currently a Research Scientist at ByteDance Seed, and a Visiting Student at the University of Oxford.