- July 22, 2025: Paper “Mast: Efficient Training of Mixture-of-Experts Transformers with Task Pipelining and Ordering” received the Distinguished Paper Award at IEEE ICDCS 2025
- June 30, 2025: Honored with the Huawei Best Technology Collaboration Award
- April 30, 2025: Two papers (ScheInfer and SQ-DeAR) accepted by Euro-Par 2025
- March 27, 2025: Two papers accepted by ICDCS 2025
- December 6, 2024: Paper “SP-MoE: Expediting Mixture-of-Experts Training with Optimized Pipelining Planning” accepted by INFOCOM 2025
- October 3, 2024: Paper “FSMoE: A Flexible and Scalable Training System for Sparse Mixture-of-Experts Models” accepted by ASPLOS 2025
- September 21, 2024: Awarded as one of Outstanding Contributors to Ascend Research Innovation
- September 16, 2024: Ranked as 2024 World’s Top 2% Scientists by Stanford University
- June 11, 2024: Two papers accepted by ICPP 2024
- April 15, 2024: Paper “Scheduling Deep Learning Jobs in Multi-Tenant GPU Clusters via Wise Resource Sharing” accepted by IEEE/ACM IWQoS 2024
- January 16, 2024: Paper “FedImpro: Measuring and Improving Client Update in Federated Learning” accepted by ICLR 2024
- January 10, 2024: Paper “ScheMoE: An Extensible Mixture-of-Experts Distributed Training System with Tasks Scheduling” accepted by EuroSys 2024
Research Experience
- October 2023 - Present: Professor, HITSZ
- September 2022 - September 2023: Assistant Professor, HITSZ
- September 2020 - August 2022: Research Assistant Professor, HKUST
- April 2019 - May 2020: Deep Learning Intern, NVIDIA
- February 2014 - March 2016: Senior Research Assistant, Hong Kong Baptist University
- February 2013 - February 2014: Research Assistant, Hong Kong Baptist University
Education
- Ph.D. in Computer Science, Hong Kong Baptist University, 2020
- M.Sc. in Computer Science, Harbin Institute of Technology, Shenzhen, 2013
- B.Eng. in Software Engineering, South China University of Technology, 2010
Background
Currently a Professor at the School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen. Research interests include distributed machine learning systems, GPU computing, parallel and distributed systems, and deep learning.