- "Collaborative Pure Exploration in Kernel Bandit" (ICLR 2023)
- "Branching Reinforcement Learning" (ICML)
Research Experience
- Postdoc at UIUC from 2023 to 2025, advised by Prof. R. Srikant
- Visited Cornell University in Fall 2022, working with Prof. Wen Sun
- Research intern at MSR Asia from January to May 2020, mentored by Dr. Wei Chen
- Collaborated with industry partners such as Nvidia and Microsoft
Education
- Ph.D. in Computer Science, September 2018 - June 2023, Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University, Advisor: Prof. Longbo Huang
- B.E. in Computer Science, September 2014 - June 2018, Xiamen University
Background
- Research interests: machine learning, including reinforcement learning, online learning (particularly bandits), and multi-task learning
- Recent research interests: application of RL and bandits in LLMs (e.g., RLHF and DPO), and diffusion models for decision making
- Tenure-track assistant professor at the ESD pillar of the Singapore University of Technology and Design (SUTD)
Miscellany
- Actively looking for Ph.D. students with full scholarship (Spring or Fall 2026), research interns, and visiting scholars
- Co-mentored two undergraduate students with his Ph.D. advisor, both projects published in top conferences NeurIPS and ICLR (the student is the first author)