His papers have won IEEE 2024 SPS Young Author Best Paper Award, Outstanding Paper Award in NeurIPS 2023, and Best Student Paper Honorable Mention in WACV 2021. Serves as Action Editor for Transactions on Machine Learning Research (TMLR) and ACM Transactions on Intelligent Systems and Technology (TIST). Has served as Senior Area Chair or Area Chair for multiple top conferences including NeurIPS, ICML, etc.
Research Experience
From 2021 to 2023, he was a Principal Researcher at Microsoft Research Redmond, leading several teams to productize these techniques for Microsoft-OpenAI core models (Copilot, DALL-E-2, ChatGPT, GPT-4).
Education
Received his Ph.D. from Northwestern University in 2015 and B.S. from Tsinghua University in 2010.
Background
Currently an Associate Professor in Computer Science and Engineering at the Chinese University of Hong Kong. Also serves as a Lead Scientist at Shanghai AI Lab and a Professor at Shanghai Innovation Institute. Specializes in model compression & efficiency, and large language/multimodality models.
Miscellany
Invited to give talks at various international conferences and workshops, such as NeurIPS 2025, ICLR 2025, etc. Organized the Efficient Natural Language and Speech Processing Workshop at NeurIPS 2024.