AgoraResearch hub
ExploreLibraryProfile
Account
Baoxiang Wang
Scholar

Baoxiang Wang

Google Scholar ID: cQe4OeYAAAAJ
Assistant Professor, The Chinese University of Hong Kong Shenzhen
reinforcement learning
Homepage↗Google Scholar↗
Citations & Impact
All-time
Citations
684
 
H-index
15
 
i10-index
19
 
Publications
20
 
Co-authors
10
list available
Contact
No contact links provided.
Publications
19 items
Epistemic Gain, Aleatoric Cost: Uncertainty Decomposition in Multi-Agent Debate for Math Reasoning
2026
Cited
0
Talk, Judge, Cooperate: Gossip-Driven Indirect Reciprocity in Self-Interested LLM Agents
2026
Cited
0
The Optimal Token Baseline: Variance Reduction for Long-Horizon LLM-RL
2026
Cited
0
Trust Region Masking for Long-Horizon LLM Reinforcement Learning
2025
Cited
0
Taming the Tail: Stable LLM Reinforcement Learning via Dynamic Vocabulary Pruning
2025
Cited
0
Policy-Conditioned Policies for Multi-Agent Task Solving
2025
Cited
0
Reinforcement Learning for Target Zone Blood Glucose Control
2025
Cited
0
Bayesian Persuasion as a Bargaining Game
2025
Cited
0
Resume (English only)
Co-authors
10 total
Shuai Li (李帅)
Shuai Li (李帅)
Shanghai Jiao Tong University
Jing Dong
Jing Dong
The Chinese University of Hong Kong, Shenzhen
Hongyuan Zha
Hongyuan Zha
The Chinese University of Hong Kong, Shenzhen
Kun Kuang
Kun Kuang
Zhejiang University
Furui Liu
Furui Liu
Zhejiang Lab and UCAS and Zhejiang University
Fei Wu
Fei Wu
Professor of Computer Science, Zhejiang University
Jun XIAO (肖俊)
Jun XIAO (肖俊)
Institute of Artificial Intelligence, Zhejiang University
Co-author 8
Co-author 8

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?