Scholar

Yuki Ichihara

Google Scholar ID: qexbctUAAAAJ

Nara Institute of Science and Technology

Reinforcement LearningNLP

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

18

H-index

2

i10-index

1

Publications

5

Co-authors

0

Contact

No contact links provided.

Publications

6 items

Reliable Chain-of-Thought via Prefix Consistency

2026

Cited

0

CITE: Anytime-Valid Statistical Inference in LLM Self-Consistency

2026

Cited

0

Consensus Group Relative Policy Optimization for Text Generation

2026

Cited

0

MO-GRPO: Mitigating Reward Hacking of Group Relative Policy Optimization on Multi-Objective Problems

2025

Cited

0

Theoretical Guarantees for Minimum Bayes Risk Decoding

2025

Cited

0

Evaluation of Best-of-N Sampling Strategies for Language Model Alignment

2025

Cited

0

Resume (English only)

Co-authors

0 total

Co-authors: 0 (list not available)