Published several papers and participated in important projects such as PanGu LLMs optimization. Received multiple scholarships, including National Scholarship (Oct. 2018), Schlumberger Master Scholarship (Oct. 2017), University of Chinese Academy of Sciences Scholarship (Oct. 2016), and First Prize Scholarship from Shandong University (Oct. 2013).
Research Experience
Researcher @ Huawei Noah Lab, June 2019 - Current, working on deep learning performance optimization, including edge and cloud device optimization; Intern/Project Cooperation @ Sugon, Mar. 2018 - June 2019, responsible for linear system benchmark HPL optimization, proposed HHPL algorithm; Intern @ TensorStack (Beijing YunGe Technology), Mar. 2017 - Aug. 2017, mainly responsible for distributed data preprocessing and graph computing; Intern @ Torray Networks, Oct. 2015 - Aug. 2016, conducted high-performance computing optimization, using NVIDIA GPU to accelerate cryo-EM image reconstruction program Relion.
Education
Master, Computer System Architecture, Institute of Computing Technology, Chinese Academy of Sciences, 2016-2019, Advisor: Prof. Guangming Tan, Thesis: Design and Implementation of Large Scale HPL Algorithm for Exascale Computing; Bachelor, Computer Science and Technology, Shandong University, 2012-2016, Advisor: Prof. Weiguo Liu (2017 Gordon Bell Prize Winner), Thesis: Using heterogeneous high-performance computing unified framework OpenCL to accelerate biological big data calculation.
Background
Interests: Heterogeneous Computing, Distributed Computing, Deep Learning Performance Optimization, TOP500 HPL Heterogeneous Optimization, High-Performance Math Library Optimization, Graph Computing, etc.