Scholar
Abe Ittycheriah
Google Scholar ID: 8P1Y_90AAAAJ
Google
NLP
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
12,418
H-index
19
i10-index
27
Publications
20
Co-authors
13
list available
Contact
No contact links provided.
Publications
1 items
RRM: Robust Reward Model Training Mitigates Reward Hacking
arXiv.org · 2024
Cited
4
Resume (English only)
Co-authors
13 total
Co-author 1
Salim Roukos, Salim Roucos
IBM
Haitao Mi
Principal Researcher, Tencent US
Martin Franz
IBM Research AI
Zhiguo Wang
Principal Scientist at AWS AI Labs
Radu Florian
Research Staff Member, IBM
Xiaoqiang Luo
LinkedIn
Bhuvana Ramabhadran
Director/Principal Research Scientist, Google DeepMind
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up