Scholar
Jimmy Ba
Google Scholar ID: ymzxRhAAAAAJ
University of Toronto
Neural Networks
Artificial Intelligence
Machine Learning
Deep Learning
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
287,794
H-index
64
i10-index
98
Publications
20
Co-authors
12
list available
Contact
No contact links provided.
Publications
1 items
EMA Policy Gradient: Taming Reinforcement Learning for LLMs with EMA Anchor and Top-k KL
2026
Cited
0
Resume (English only)
Co-authors
12 total
Diederik P. Kingma
Anthropic
Ruslan Salakhutdinov
UPMC Professor, Machine Learning Department, CMU
Yoshua Bengio
Professor of computer science, University of Montreal, Mila, IVADO, CIFAR
Roger Grosse
Associate Professor, University of Toronto
Geoffrey Hinton
Emeritus Prof. Computer Science, University of Toronto
Co-author 6
Sanja Fidler
University of Toronto, NVIDIA
Kevin Swersky
Google Brain
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up