Scholar
Bilal Piot
Google Scholar ID: fqxNUREAAAAJ
Google Deepmind
reinforcement learning
inverse reinforcement learning
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
23,271
H-index
41
i10-index
57
Publications
20
Co-authors
102
list available
Contact
No contact links provided.
Publications
4 items
Gemma 3 Technical Report
2025
Cited
0
Learning from negative feedback, or positive feedback or both
2024
Cited
1
RRM: Robust Reward Model Training Mitigates Reward Hacking
arXiv.org · 2024
Cited
4
Building Math Agents with Multi-Turn Iterative Preference Learning
arXiv.org · 2024
Cited
18
Resume (English only)
Co-authors
102 total
Olivier Pietquin
Earth Species Project | ex Google DeepMind (On leave - Professor at University of Lille)
Mohammad Gheshlaghi Azar
Cohere
Rémi Munos
FAIR, Meta
Zhaohan Daniel Guo
DeepMind
Co-author 5
Jean-bastien Grill
Unknown affiliation
Florian STRUB
Cohere
Corentin Tallec
DeepMind
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up