Scholar
Heyang Zhao
Google Scholar ID: zHQ1ap0AAAAJ
UCLA
Machine Learning
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
238
H-index
8
i10-index
7
Publications
14
Co-authors
15
list available
Contact
No contact links provided.
Publications
7 items
Near-Optimal Regret for KL-Regularized Multi-Armed Bandits
2026
Cited
0
Best-of-Majority: Minimax-Optimal Strategy for Pass@$k$ Inference Scaling
2025
Cited
0
Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration
2025
Cited
0
Logarithmic Regret for Online KL-Regularized Reinforcement Learning
2025
Cited
0
Nearly Optimal Sample Complexity of Offline KL-Regularized Contextual Bandits under Single-Policy Concentrability
2025
Cited
0
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
arXiv.org · 2024
Cited
1
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation
arXiv.org · 2023
Cited
11
Resume (English only)
Co-authors
15 total
Quanquan Gu
Associate Professor of Computer Science, UCLA
Jiafan He
PhD student, Department of Computer Science, UCLA
Dongruo Zhou
Indiana University Bloomington
Tong Zhang
UIUC
Co-author 5
XUHENG LI
Department of Computer Science, University of California, Los Angeles
Chenlu Ye
Computer Science, University of Illinois Urbana-Champaign
Farzad Farnoud
University of Virginia
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up