Selected Publications: 'The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models', 'Process Reinforcement through Implicit Rewards' accepted by NeurIPS 2024, ICML 2024, etc.; Honors & Awards: The Most Outstanding Students Award of UESTC (Top 10 in all undergraduates), UESTC-LuZhouLaoJiao Scholarship (10K RMB), etc.
Research Experience
Closely working with Dr. Ganqu Cui and Prof. Yu Cheng on research projects.
Education
PhD Student at Peking University (Fall 2025), advised by Prof. Ning Ding and Prof. Bowen Zhou; B.E. from the University of Electronic Science and Technology of China, with the Most Outstanding Students Award.
Background
Research Interests: Building Large Reasoning Models in both digital and physical world with scalable and generalizable Reinforcement Learning methods. Brief Introduction: Yuchen Zhang is a PhD student dedicated to building large reasoning models in the digital and physical worlds.
Miscellany
Personal Interests: Feel free to contact if you’re interested in relevant research or would like to discuss potential collaborations!