Efficient and Effective Sparse LSTM on FPGA with Bank-Balanced Sparsity, FPGA '19
Balanced Sparsity for Efficient DNN Inference on GPU, AAAI '19
Scheduling CPU for GPU-based Deep Learning Jobs, SoCC '18 poster
Gandiva: Introspective Cluster Scheduling for Deep Learning, OSDI '18
Research Experience
Currently an AI system developer/researcher in PAI team of Alibaba Group, focusing on building a highly efficient deep learning infrastructure; previously spent over 5 years in the system research group at Microsoft Research.
Education
Pursued Ph.D. in the system research group of Microsoft Research, supervised by Lidong Zhou from Microsoft Research and Prof. Wei Li from Beihang University.
Background
Research interests widely spread in computer system related areas, including both traditional topics for operating systems, and modern directions with heterogeneous hardwares and new applications. Focused on providing better system support for large-scale artificial intelligent applications.