International Conference on Machine Learning · 2024
Cited
1
Resume (English only)
Academic Achievements
Published several papers, including preprints and conference papers, such as 'Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference', 'Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation', and more.
Research Experience
Currently pursuing a Ph.D. at the School of Artificial Intelligence, Nanjing University, involved in various research projects related to reinforcement learning.
Education
Ph.D. student at the School of Artificial Intelligence, Nanjing University, advised by Professor Yang Yu and Associate Professor Zongzhang Zhang, and a member of the LAMDA Group led by Professor Zhi-Hua Zhou.
Background
Research interests include theoretically justified algorithms and real-world applications of Reinforcement Learning (RL), particularly in reward model learning, offline RL and Sim2Real transfer, decision making in non-stationary environments. Also interested in decision making via Large Language Models (LLMs).
Miscellany
Feel free to discuss or collaborate, email: liyc@lamda.nju.edu.cn