Scholar
Kunhao Zheng
Google Scholar ID: zDy4jSYAAAAJ
Meta FAIR
Code Generation
Reasoning
Reinforcement Learning
Large Language Model
Theorem Proving
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,188
H-index
8
i10-index
7
Publications
16
Co-authors
33
list available
Contact
No contact links provided.
Publications
9 items
WybeCoder: Verified Imperative Code Generation
2026
Cited
0
CWM: An Open-Weights LLM for Research on Code Generation with World Models
2025
Cited
0
Improving Diversity in Language Models: When Temperature Fails, Change the Loss
2025
Cited
0
Optimizing Language Models for Inference Time Objectives using Reinforcement Learning
2025
Cited
0
The KoLMogorov Test: Compression by Code Generation
2025
Cited
0
Soft Policy Optimization: Online Off-Policy RL for Sequence Models
2025
Cited
0
PILAF: Optimal Human Preference Sampling for Reward Modeling
2025
Cited
0
What Makes Large Language Models Reason in (Multi-Turn) Code Generation?
arXiv.org · 2024
Cited
8
Load more
Resume (English only)
Co-authors
33 total
Ya Zhang
Shanghai Jiao Tong University
Chen Ju
Alibaba Group, Shanghai Jiao Tong University
Co-author 3
Co-author 4
Gabriel Synnaeve
Research scientist at Facebook AI Research
Taco Cohen
Qualcomm AI Research
Weidi Xie
Shanghai Jiao Tong University | VGG, University of Oxford
Jonas Gehring
Facebook AI Research
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up