Scholar

Rosie Zhao

Google Scholar ID: rgwbR6wAAAAJ

Harvard University

reinforcement learninglarge language modelsoptimization

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

482

H-index

9

i10-index

9

Publications

20

Co-authors

0

Contact

No contact links provided.

Publications

8 items

Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models

2026

Cited

0

GenAI for Systems: Recurring Challenges and Design Principles from Software to Silicon

2026

Cited

0

On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs

2026

Cited

0

Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs

2025

Cited

0

Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining

2025

Cited

0

Distributional Scaling Laws for Emergent Capabilities

2025

Cited

0

SOAP: Improving and Stabilizing Shampoo using Adam

arXiv.org · 2024

Cited

8

Deconstructing What Makes a Good Optimizer for Language Models

arXiv.org · 2024

Cited

15

Resume (English only)

Co-authors

0 total

Co-authors: 0 (list not available)