Scholar
Jiacai Liu
Google Scholar ID: zt9Jfh4AAAAJ
Fudan University
reinforcement learning
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
325
H-index
6
i10-index
5
Publications
12
Co-authors
5
list available
Contact
No contact links provided.
Publications
10 items
The Optimal Token Baseline: Variance Reduction for Long-Horizon LLM-RL
2026
Cited
0
Beyond Precision: Training-Inference Mismatch is an Optimization Problem and Simple LR Scheduling Fixes It
2026
Cited
1
Trust Region Masking for Long-Horizon LLM Reinforcement Learning
2025
Cited
0
A Note on Hybrid Online Reinforcement and Imitation Learning for LLMs: Formulations and Algorithms
2025
Cited
0
Taming the Tail: Stable LLM Reinforcement Learning via Dynamic Vocabulary Pruning
2025
Cited
0
On the Convergence of Policy Mirror Descent with Temporal Difference Evaluation
2025
Cited
0
Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents
2025
Cited
0
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy
2025
Cited
0
Load more
Resume (English only)
Co-authors
5 total
Chaojie Wang
AI Researcher, Skywork AI
Liang Zeng
Skywork AI; IIIS, Tsinghua University
Chris Yuhao Liu
University of California, Santa Cruz
Jiawei Wang
University of Science and Technology of China
Yuqian Fu
Institute of Automation,Chinese Academy of Sciences
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up