AgoraResearch hub
ExploreLibraryProfile
Account
Penghui Qi
Scholar

Penghui Qi

Google Scholar ID: CLRsGEMAAAAJ
Sea AI Lab & PhD student of NUS
Machine LearningReinforcement LearningMLSys
Google Scholar↗
Citations & Impact
All-time
Citations
526
 
H-index
6
 
i10-index
4
 
Publications
9
 
Co-authors
8
list available
Contact
No contact links provided.
Publications
8 items
Rethinking the Trust Region in LLM Reinforcement Learning
2026
Cited
1
Revisiting Parameter Server in LLM Post-Training
2026
Cited
1
Defeating the Training-Inference Mismatch via FP16
2025
Cited
0
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
2025
Cited
0
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
2025
Cited
0
Understanding R1-Zero-Like Training: A Critical Perspective
2025
Cited
0
PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization
2025
Cited
0
Balancing Pipeline Parallelism with Vocabulary Parallelism
arXiv.org · 2024
Cited
2
Resume (English only)
Co-authors
8 total
Min Lin
Min Lin
Principal Research Scientist, Sea AI Lab
Zichen Liu
Zichen Liu
Sea AI Lab; National University of Singapore
Tianyu Pang
Tianyu Pang
Senior Research Scientist, Sea AI Lab
Chao Du
Chao Du
Senior Research Scientist, Sea AI Lab
Wee Sun Lee
Wee Sun Lee
Professor, Department of Computer Science, National University of Singapore
Xinyi Wan
Xinyi Wan
Sea AI Lab
Junxiao Song
Junxiao Song
DeepSeek AI
Co-author 8
Co-author 8

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?