Scholar
Rosie Zhao
Google Scholar ID: rgwbR6wAAAAJ
Harvard University
reinforcement learning
large language models
optimization
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
482
H-index
9
i10-index
9
Publications
20
Co-authors
0
Contact
No contact links provided.
Publications
8 items
Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models
2026
Cited
0
GenAI for Systems: Recurring Challenges and Design Principles from Software to Silicon
2026
Cited
0
On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs
2026
Cited
0
Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs
2025
Cited
0
Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining
2025
Cited
0
Distributional Scaling Laws for Emergent Capabilities
2025
Cited
0
SOAP: Improving and Stabilizing Shampoo using Adam
arXiv.org · 2024
Cited
8
Deconstructing What Makes a Good Optimizer for Language Models
arXiv.org · 2024
Cited
15
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up