Scholar
Zhenrong Zhang
Google Scholar ID: 1PQFwPAAAAAJ
University of Science and Technology of China
Large Language Model
Reinforcement Learning
Document Understanding
Table Recognition
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
249
H-index
7
i10-index
6
Publications
20
Co-authors
15
list available
Contact
No contact links provided.
Publications
1 items
Step Potential Advantage Estimation: Harnessing Intermediate Confidence and Correctness for Efficient Mathematical Reasoning
arXiv.org · 2026
Cited
0
Resume (English only)
Co-authors
15 total
Jun Du
Professor, NERC-SLIP, USTC
Jianshu Zhang
iFLYTEK Research
Jiefeng Ma
USTC
Pengfei Hu
University of Science and Technology of China
Chenyu Liu
iFLYTEK Research, USTC
Qikai Chang
University of Science and Technology of China
Qing Wang
Associate Researcher, University of Science and Technology of China
yicheng pan
中国科学技术大学
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up