Scholar
Yuki Ichihara
Google Scholar ID: qexbctUAAAAJ
Nara Institute of Science and Technology
Reinforcement Learning
NLP
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
18
H-index
2
i10-index
1
Publications
5
Co-authors
0
Contact
No contact links provided.
Publications
4 items
Consensus Group Relative Policy Optimization for Text Generation
2026
Cited
0
MO-GRPO: Mitigating Reward Hacking of Group Relative Policy Optimization on Multi-Objective Problems
2025
Cited
0
Theoretical Guarantees for Minimum Bayes Risk Decoding
2025
Cited
0
Evaluation of Best-of-N Sampling Strategies for Language Model Alignment
2025
Cited
0
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up