Scholar
Zhiyong Wang
Google Scholar ID: JnT7gacAAAAJ
The University of Edinburgh
Reinforcement Learning
Bandits
Online Learning
Machine Learning
Generative AI
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
132
H-index
7
i10-index
6
Publications
20
Co-authors
37
list available
Contact
No contact links provided.
Publications
3 items
Near-Constant Strong Violation and Last-Iterate Convergence for Online CMDPs via Decaying Safety Margins
2026
Cited
0
Multi-Task GRPO: Reliable LLM Reasoning Across Tasks
2026
Cited
0
Workflow-R1: Group Sub-sequence Policy Optimization for Multi-turn Workflow Construction
2026
Cited
0
Resume (English only)
Co-authors
37 total
Jize Xie
The Hong Kong University of Science and Technology
Xutong Liu
Assistant Professor of Computer Science and Systems, University of Washington
Shuai Li (李帅)
Shanghai Jiao Tong University
Dongruo Zhou
Indiana University Bloomington
Xiangxiang Dai
The Chinese University of Hong Kong
Jinhang Zuo
Assistant Professor of Computer Science, City University of Hong Kong
Tong Yu
Adobe Research
Zhongxiang Dai
Assistant Professor, The Chinese University of Hong Kong, Shenzhen
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up