Scholar
Baoxiang Wang
Google Scholar ID: cQe4OeYAAAAJ
Assistant Professor, The Chinese University of Hong Kong Shenzhen
reinforcement learning
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
684
H-index
15
i10-index
19
Publications
20
Co-authors
10
list available
Contact
No contact links provided.
Publications
19 items
Epistemic Gain, Aleatoric Cost: Uncertainty Decomposition in Multi-Agent Debate for Math Reasoning
2026
Cited
0
Talk, Judge, Cooperate: Gossip-Driven Indirect Reciprocity in Self-Interested LLM Agents
2026
Cited
0
The Optimal Token Baseline: Variance Reduction for Long-Horizon LLM-RL
2026
Cited
0
Trust Region Masking for Long-Horizon LLM Reinforcement Learning
2025
Cited
0
Taming the Tail: Stable LLM Reinforcement Learning via Dynamic Vocabulary Pruning
2025
Cited
0
Policy-Conditioned Policies for Multi-Agent Task Solving
2025
Cited
0
Reinforcement Learning for Target Zone Blood Glucose Control
2025
Cited
0
Bayesian Persuasion as a Bargaining Game
2025
Cited
0
Load more
Resume (English only)
Co-authors
10 total
Shuai Li (李帅)
Shanghai Jiao Tong University
Jing Dong
The Chinese University of Hong Kong, Shenzhen
Hongyuan Zha
The Chinese University of Hong Kong, Shenzhen
Kun Kuang
Zhejiang University
Furui Liu
Zhejiang Lab and UCAS and Zhejiang University
Fei Wu
Professor of Computer Science, Zhejiang University
Jun XIAO (肖俊)
Institute of Artificial Intelligence, Zhejiang University
Co-author 8
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up