AgoraResearch hub
ExploreLibraryProfile
Account
Zhiyong Wang
Scholar

Zhiyong Wang

Google Scholar ID: JnT7gacAAAAJ
The University of Edinburgh
Reinforcement LearningBanditsOnline LearningMachine LearningGenerative AI
Homepage↗Google Scholar↗
Citations & Impact
All-time
Citations
132
 
H-index
7
 
i10-index
6
 
Publications
20
 
Co-authors
37
list available
Contact
No contact links provided.
Publications
3 items
Near-Constant Strong Violation and Last-Iterate Convergence for Online CMDPs via Decaying Safety Margins
2026
Cited
0
Multi-Task GRPO: Reliable LLM Reasoning Across Tasks
2026
Cited
0
Workflow-R1: Group Sub-sequence Policy Optimization for Multi-turn Workflow Construction
2026
Cited
0
Resume (English only)
Co-authors
37 total
Jize Xie
Jize Xie
The Hong Kong University of Science and Technology
Xutong Liu
Xutong Liu
Assistant Professor of Computer Science and Systems, University of Washington
Shuai Li (李帅)
Shuai Li (李帅)
Shanghai Jiao Tong University
Dongruo Zhou
Dongruo Zhou
Indiana University Bloomington
Xiangxiang Dai
Xiangxiang Dai
The Chinese University of Hong Kong
Jinhang Zuo
Jinhang Zuo
Assistant Professor of Computer Science, City University of Hong Kong
Tong Yu
Tong Yu
Adobe Research
Zhongxiang Dai
Zhongxiang Dai
Assistant Professor, The Chinese University of Hong Kong, Shenzhen

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?