Scholar

Yi-Chen Li

Google Scholar ID: OA3GmbQAAAAJ

Nanjing University

Reinforcement LearningImitation LearningRLHF

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

129

H-index

i10-index

Publications

Co-authors

Contact

Emailliyc@lamda.nju.edu.cn GitHubOpen ↗

Publications

10 items

REAR: Test-time Preference Realignment through Reward Decomposition

2026

Cited

Provably Efficient Policy-Reward Co-Pretraining for Adversarial Imitation Learning

2026

Cited

RMGAP: Benchmarking the Generalization of Reward Models across Diverse Preferences

2026

Cited

Off-Policy Value-Based Reinforcement Learning for Large Language Models

2026

Cited

Non-Adversarial Imitation Learning Provably Free of Compounding Errors: The Role of Bellman Constraints

2026

Cited

Generalist Reward Models: Found Inside Large Language Models

2025

Cited

Controlling Large Language Model with Latent Actions

2025

Cited

Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference

2025

Cited

Resume (English only)

Academic Achievements

Published several papers, including preprints and conference papers, such as 'Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference', 'Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation', and more.

Research Experience

Currently pursuing a Ph.D. at the School of Artificial Intelligence, Nanjing University, involved in various research projects related to reinforcement learning.

Education

Ph.D. student at the School of Artificial Intelligence, Nanjing University, advised by Professor Yang Yu and Associate Professor Zongzhang Zhang, and a member of the LAMDA Group led by Professor Zhi-Hua Zhou.

Background

Research interests include theoretically justified algorithms and real-world applications of Reinforcement Learning (RL), particularly in reward model learning, offline RL and Sim2Real transfer, decision making in non-stationary environments. Also interested in decision making via Large Language Models (LLMs).

Miscellany

Feel free to discuss or collaborate, email: liyc@lamda.nju.edu.cn

Co-authors

0 total

Co-authors: 0 (list not available)