Scholar

Bilal Piot

Google Scholar ID: fqxNUREAAAAJ

Google Deepmind

reinforcement learninginverse reinforcement learning

Google Scholar↗

Citations & Impact

All-time

Citations

23,271

H-index

41

i10-index

57

Publications

20

Co-authors

102

list available

Contact

No contact links provided.

Publications

4 items

Gemma 3 Technical Report

2025

Cited

0

Learning from negative feedback, or positive feedback or both

2024

Cited

1

RRM: Robust Reward Model Training Mitigates Reward Hacking

arXiv.org · 2024

Cited

4

Building Math Agents with Multi-Turn Iterative Preference Learning

arXiv.org · 2024

Cited

18

Resume (English only)

Co-authors

102 total

Olivier Pietquin

Earth Species Project | ex Google DeepMind (On leave - Professor at University of Lille)

Mohammad Gheshlaghi Azar

Zhaohan Daniel Guo

Jean-bastien Grill

Unknown affiliation

Corentin Tallec