Published several papers such as 'POLARIS: A Post-Training Recipe for Scaling Reinforcement Learning on Advanced Reasoning Models' and 'L-Eval: Instituting Standardized Evaluation for Long Context Language Models'. Won the Outstanding Paper Award at ACL 2024. Also received the Hong Kong PhD Fellowship Scheme (HKPFS) and National Scholarship at Fudan University.
Research Experience
Engaged in multiple research projects during the Ph.D. program, concentrating on the evaluation and improvement of long-context language models.
Education
Received bachelor’s and master’s degrees from the Department of Computer Science at Fudan University, advised by Prof. Xipeng Qiu; currently a Ph.D. candidate at HKU, supervised by Lingpeng Kong.
Background
A second-year Ph.D. candidate at HKU, focusing on building effective long-context LLMs and scaling reinforcement learning with LLMs.