AgoraResearch hub
ExploreLibraryProfile
Account
Xuekai Zhu
Scholar

Xuekai Zhu

Google Scholar ID: plXXtQkAAAAJ
Shanghai Jiao Tong University
Synthetic DataReasoningLanguage Model
Homepage↗Google Scholar↗
Citations & Impact
All-time
Citations
453
 
H-index
11
 
i10-index
11
 
Publications
20
 
Co-authors
12
list available
Contact
No contact links provided.
Publications
18 items
Does LLM Alignment Really Need Diversity? An Empirical Study of Adapting RLVR Methods for Moral Reasoning
2026
Cited
0
How Far Can Unsupervised RLVR Scale LLM Training?
2026
Cited
0
Flow of Spans: Generalizing Language Models to Dynamic Span-Vocabulary via GFlowNets
2026
Cited
0
SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks
2026
Cited
1
Controlling Exploration-Exploitation in GFlowNets via Markov Chain Perspectives
2026
Cited
0
FlowRL: Matching Reward Distributions for LLM Reasoning
2025
Cited
0
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
2025
Cited
0
A Survey of Reinforcement Learning for Large Reasoning Models
2025
Cited
0
Resume (English only)
Co-authors
12 total
Kaiyan Zhang
Kaiyan Zhang
Tsinghua University
Bowen Zhou
Bowen Zhou
Chair Professor, Department of Electrical Engineering, Tsinghua University; Founder of Frontis.ai
Ning Ding
Ning Ding
Assistant Professor, Tsinghua University
Ermo Hua
Ermo Hua
Tsinghua University
Xingtai Lv
Xingtai Lv
Tsinghua University
Xinwei Long
Xinwei Long
Tsinghua University
Zhouhan Lin(林洲汉)
Zhouhan Lin(林洲汉)
Shanghai Jiao Tong University; Mila Lab; Facebook AI Research
Daixuan Cheng
Daixuan Cheng
Gaoling School of AI, Renmin University of China

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?