Scholar
Yuu Jinnai
Google Scholar ID: H0MaUNIAAAAJ
CyberAgent, Inc.
Artificial Intelligence
Machine Learning
Reinforcement Learning
Heuristic Search
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
737
H-index
11
i10-index
11
Publications
20
Co-authors
15
list available
Contact
CV
Open ↗
GitHub
Open ↗
Publications
9 items
Consensus Group Relative Policy Optimization for Text Generation
2026
Cited
0
Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech Recognition
2025
Cited
0
MO-GRPO: Mitigating Reward Hacking of Group Relative Policy Optimization on Multi-Objective Problems
2025
Cited
0
Do Large Language Models Know Folktales? A Case Study of Yokai in Japanese Folktales
2025
Cited
0
Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal Transport
2025
Cited
0
Theoretical Guarantees for Minimum Bayes Risk Decoding
2025
Cited
0
Evaluation of Best-of-N Sampling Strategies for Language Model Alignment
2025
Cited
0
Annotation-Efficient Preference Optimization for Language Model Alignment
arXiv.org · 2024
Cited
2
Load more
Resume (English only)
Co-authors
15 total
David Abel
DeepMind / University of Edinburgh
Tetsuro Morimura
CyberAgent, Inc.
George Konidaris
Brown
Michael Littman
Brown University
Alex Fukunaga
The University of Tokyo
Kaito Ariu
Research Scientist, CyberAgent / kaitoariu@gmail.com
Kenshi Abe
CyberAgent, Inc.
Co-author 8
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up