Scholar
Xuekai Zhu
Google Scholar ID: plXXtQkAAAAJ
Shanghai Jiao Tong University
Synthetic Data
Reasoning
Language Model
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
453
H-index
11
i10-index
11
Publications
20
Co-authors
12
list available
Contact
No contact links provided.
Publications
18 items
Does LLM Alignment Really Need Diversity? An Empirical Study of Adapting RLVR Methods for Moral Reasoning
2026
Cited
0
How Far Can Unsupervised RLVR Scale LLM Training?
2026
Cited
0
Flow of Spans: Generalizing Language Models to Dynamic Span-Vocabulary via GFlowNets
2026
Cited
0
SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks
2026
Cited
1
Controlling Exploration-Exploitation in GFlowNets via Markov Chain Perspectives
2026
Cited
0
FlowRL: Matching Reward Distributions for LLM Reasoning
2025
Cited
0
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
2025
Cited
0
A Survey of Reinforcement Learning for Large Reasoning Models
2025
Cited
0
Load more
Resume (English only)
Co-authors
12 total
Kaiyan Zhang
Tsinghua University
Bowen Zhou
Chair Professor, Department of Electrical Engineering, Tsinghua University; Founder of Frontis.ai
Ning Ding
Assistant Professor, Tsinghua University
Ermo Hua
Tsinghua University
Xingtai Lv
Tsinghua University
Xinwei Long
Tsinghua University
Zhouhan Lin(林洲汉)
Shanghai Jiao Tong University; Mila Lab; Facebook AI Research
Daixuan Cheng
Gaoling School of AI, Renmin University of China
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up