Released ROLL, Alibaba’s optimized framework for large-scale reinforcement learning
Associate Editor of IEEE Transactions on Parallel and Distributed Systems
TPC member for IEEE ICNP 2025 and IEEE HiPC 2025
Background
Associate Professor, Department of Computer Science and Engineering, The Hong Kong University of Science and Technology
Associate Director, HKUST Big Data Institute
Associate Director, HKUST-Alibaba Joint Lab on Big Data and AI
Research interests span networking and distributed systems, with a special focus on distributed machine learning systems (MLSys) and AI cloud infrastructure
Focuses on identifying fundamental system design issues and optimization opportunities for emerging AI workloads (e.g., LLM pre-training, reinforcement learning, generative inference) on large-scale cloud infrastructure, seeking generally applicable, efficient, and easily implementable solutions