Papers: 'MoLoRAG' accepted by EMNLP'2025, 'LLMNodeBed' accepted by ICML'2025, 'GNN4TaskPlan' accepted by NeurIPS'2024; Awards: NeurIPS'2024 Top Reviewer; Projects: Released WebSailor-V2, achieving SOTA performance on challenging web browsing benchmarks; Released official technical report for Tongyi-DeepResearch and E-GRPO, a novel agentic RL algorithm based on reward shaping.
Research Experience
Research Intern at Tongyi DeepResearch Team, contributing to the development of Tongyi-DeepResearch-30B-A3B.
Education
Ph.D.: The Chinese University of Hong Kong (Advisor: Prof. Hong CHENG); M.S.: Computer Science, Fudan University (Advisor: Prof. Yun XIONG), 2024; B.S.: Computer Science, Fudan University, 2021.
Background
Research Interests: Enhancing long-horizon planning, reasoning, and tool-use capabilities for language agents. Brief: Ph.D. student at The Chinese University of Hong Kong, with previous research in Graph Learning.
Miscellany
Open to summer 2026 research internships in the US.