Received several honors, including Best Paper Awards at QCE and ICML RL4RL, Best Paper Candidate at DATE, ACM SRC 1st Place, Best Poster at the NSF AI Institute, and fellowships from Qualcomm and the Unitary Fund. He was named a Rising Star in both ML & Systems and ISSCC, and was a finalist for the NVIDIA Fellowship. His research, such as SpAtten, is highly cited, being the most cited HPCA paper since 2020.
Research Experience
Made significant contributions in the field of efficient generative AI, particularly the SpAtten framework, which has been widely adopted in both academia and industry. He also developed WorkForce-Agent-R1, an RL-trained LLM web agent that boosts reasoning capability for enterprise automation.
Education
Received his Ph.D. in CS from MIT, advised by Prof. Song Han; B.Eng. with highest honors from Fudan University.
Background
His research focuses on efficient AI computing and computer architecture. His work centers around optimizing Transformer and large language models (LLMs), developing techniques such as SpAtten and Hardware-Aware Transformers.
Miscellany
Co-founded the QuCS Forum to promote AI education.