Scholar
Zhuohao Yu
Google Scholar ID: zVYE7-UAAAAJ
Peking University
Natural Language Processing
Software Engineering
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
554
H-index
10
i10-index
10
Publications
14
Co-authors
12
list available
Contact
No contact links provided.
Publications
11 items
From Curiosity to Caution: Mitigating Reward Hacking for Best-of-N with Pessimism
2026
Cited
0
SteerRM: Debiasing Reward Models via Sparse Autoencoders
2026
Cited
0
Back to Blackwell: Closing the Loop on Intransitivity in Multi-Objective Preference Fine-Tuning
2026
Cited
0
What Do Agents Learn from Trajectory-SFT: Semantics or Interfaces?
2026
Cited
0
TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them
2025
Cited
0
Benchmarking and Studying the LLM-based Code Review
2025
Cited
0
SAEMark: Multi-bit LLM Watermarking with Inference-Time Scaling
2025
Cited
0
RewardAnything: Generalizable Principle-Following Reward Models
2025
Cited
0
Load more
Resume (English only)
Co-authors
12 total
Wei Ye
Peking University
Shikun Zhang
北京大学
Yidong Wang (王一栋)
Ph.D. candidate @ PKU | M.Eng. @ TokyoTech | B.S. @ NJU
Jindong Wang
Assistant Professor, William & Mary; Ex Senior Researcher, Microsoft Research
Zhengran Zeng
Peking University
Yue Zhang
Westlake University
Xing Xie 谢幸
Partner Research Manager, Microsoft, ACM Fellow, IEEE Fellow, CCF Fellow
Ji-Rong Wen
Gaoling School of Artificial Intelligence, Renmin University of China
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up