Scholar
Jiaxuan Gao
Google Scholar ID: UHSwL-wAAAAJ
Institute for Interdisciplinary Information Sciences, Tsinghua University
multi-agent reinforcement learning
large language model
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
2,795
H-index
9
i10-index
9
Publications
20
Co-authors
3
list available
Contact
No contact links provided.
Publications
14 items
Verifiable Process Rewards for Agentic Reasoning
2026
Cited
0
Sword: Style-Robust World Models as Simulators via Dynamic Latent Bootstrapping for VLA Policy Post-Training
2026
Cited
0
MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation
2026
Cited
0
AREAL-DTA: Dynamic Tree Attention for Efficient Reinforcement Learning of Large Language Models
2026
Cited
0
From Self-Evolving Synthetic Data to Verifiable-Reward RL: Post-Training Multi-turn Interactive Tool-Using Agents
2026
Cited
0
Extending Test-Time Scaling: A 3D Perspective with Context, Batch, and Turn
2025
Cited
0
AReaL-Hex: Accommodating Asynchronous RL Training over Heterogeneous GPUs
2025
Cited
0
Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL
2025
Cited
0
Load more
Resume (English only)
Co-authors
3 total
Yi Wu
Institute for Interdisciplinary Information Sciences, Tsinghua University
Chao Yu(于超)
Tsinghua University
Shusheng Xu
IIIS, Tsinghua University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up