Publications: 'Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference', 51st International Symposium on Computer Architecture (ISCA), Buenos Aires, Argentina, June 2024; 'Tutel: Adaptive Mixture-of-Experts at Scale', 6th Conference on Machine Learning and Systems (MLSys), Miami, FL, June 2023; 'ARK: GPU-driven Code Execution for Distributed Deep Learning', 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI), Boston, MA, April 2023; 'Elastic Resource Sharing for Distributed Deep Learning', 18th USENIX Symposium on Networked Systems Design and Implementation (NSDI), Virtual Event, April 2021; 'Confident Multiple Choice Learning', 34th International Conference on Machine Learning (ICML), Sydney, Australia, August 2017; 'APUNet: Revitalizing GPU as Packet Processing Accelerator', 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI), Boston, MA, March 2017; Preprints: 'MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications', arXiv, April 2025; 'Alchemist: Towards the Design of Efficient Online Continual Learning System', arXiv, March 2025; Workshops & Posters: 'Immediate Communication for Distributed AI Tasks', 2nd Workshop on Hot Topics in System Infrastructure (HotInfra), Austin, TX, November 2024; 'Towards GPU-driven Code Execution for Distributed Deep Learning' (Awarded Best Paper), 3rd Machine Learning for Computer Architecture and Systems (MLArchSys@ISCA), New York City, NY, June 2022; 'Accelerating GNN Training with Locality-Aware Partial Execution' (Awarded Best Paper), 12th ACM SIGOPS Asia-Pacific Workshop on Systems (APSys), Virtual Event, August 2021.
Research Experience
Microsoft Research, Vancouver, BC, Canada, Senior Researcher, Jan 2024 — Present; Microsoft Research, Beijing, China, Senior Researcher, Networking Infrastructure Group, Dec 2023 — Dec 2023; Researcher 2, Networking Infrastructure Group, Mar 2022 — Nov 2023; Research Intern, Networking Research Group, Jul 2019 — Sep 2019; Dec 2018 — Feb 2019.
Education
Ph.D. (Integrated M.S.-Ph.D. Program), Electrical Engineering, Korea Advanced Institute of Science and Technology (KAIST), Feb 2017 — Feb 2022, Advisor: Prof. KyoungSoo Park; M.S. student, Electrical Engineering, KAIST, Feb 2016 — Jan 2017; B.S., Electrical Engineering (major) and Computer Science (minor), KAIST, Feb 2012 — Jan 2016.
Background
Research Interest: Artificial Intelligence, scalable networked systems, system performance optimization, GPU systems. Profile: Senior Researcher at Microsoft, studying scalable AI and large-scale GPU systems.