Scholar
Penghui Qi
Google Scholar ID: CLRsGEMAAAAJ
Sea AI Lab & PhD student of NUS
Machine Learning
Reinforcement Learning
MLSys
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
526
H-index
6
i10-index
4
Publications
9
Co-authors
8
list available
Contact
No contact links provided.
Publications
8 items
Rethinking the Trust Region in LLM Reinforcement Learning
2026
Cited
1
Revisiting Parameter Server in LLM Post-Training
2026
Cited
1
Defeating the Training-Inference Mismatch via FP16
2025
Cited
0
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
2025
Cited
0
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
2025
Cited
0
Understanding R1-Zero-Like Training: A Critical Perspective
2025
Cited
0
PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization
2025
Cited
0
Balancing Pipeline Parallelism with Vocabulary Parallelism
arXiv.org · 2024
Cited
2
Resume (English only)
Co-authors
8 total
Min Lin
Principal Research Scientist, Sea AI Lab
Zichen Liu
Sea AI Lab; National University of Singapore
Tianyu Pang
Senior Research Scientist, Sea AI Lab
Chao Du
Senior Research Scientist, Sea AI Lab
Wee Sun Lee
Professor, Department of Computer Science, National University of Singapore
Xinyi Wan
Sea AI Lab
Junxiao Song
DeepSeek AI
Co-author 8
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up