Publications: rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking, released on arXiv.
Background
Research Interests: LLM Reasoning, Post-Training, Reward Modeling. Currently pursuing an M.S. degree in Computer Technology at the School of Computer Science, Peking University.