Scholar
Zhenpeng Su
Google Scholar ID: KVOZPeIAAAAJ
Chinese Academy of Sciences; Kuaishou
Mixture-of-Experts
Reinforcement Learning
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
61
H-index
4
i10-index
1
Publications
15
Co-authors
11
list available
Contact
No contact links provided.
Publications
1 items
Good Reasoning Makes Good Demonstrations: Implicit Reasoning Quality Supervision via In-Context Reinforcement Learning
2026
Cited
0
Resume (English only)
Co-authors
11 total
Zijia Lin
Tsinghua University
Xing Wu
Xiaohongshu Inc & UCAS
Guangyuan Ma
Chinese Academy of Sciences
Yizhe Xiong
Tsinghua University
Haoran Lian
Beihang University
Yuntao Li
Peking University
Sirui Wang
Meituan
songlin hu
Unknown affiliation
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up