[{'Paper': 'Optimizing SLO-oriented LLM Serving with PD-Multiplexing', 'Authors': 'Weihao Cui^, Yukang Chen^, Han Zhao^, Ziyi Xu, Quan Chen, Xusheng Chen, Yangjie Zhou, Shixuan Sun, Minyi Guo', 'Status': 'Preprint'}, {'Paper': 'Boosting Embodied AI Agents through Perception-Generation Disaggregation and Asynchronous Pipeline Execution', 'Authors': 'Shulai Zhang, Ao Xu, Quan Chen, Han Zhao, Weihao Cui, Ningxin Zheng, Minyi Guo', 'Status': 'Preprint'}, {'Paper': 'Xputimer: Anomaly diagnostics for divergent llm training in gpu clusters of thousand-plus scale', 'Authors': 'Weihao Cui, Ji Zhang, Han Zhao*, Chao Liu, Wenhao Zhang, Jian Sha, Quan Chen, Bingsheng He, Minyi Guo', 'Status': 'Preprint'}, {'Paper': 'Efficient Function-as-a-Service for Large Language Models with TIDAL', 'Authors': 'Weihao Cui, Ziyi Xu, Han Zhao, Quan Chen, Zijun Li, Bingsheng He, Minyi Guo', 'Status': 'Preprint'}, {'Paper': 'A codesign of scheduling and parallelization for large model training in heterogeneous clusters', 'Authors': 'Chunyu Xue, Weihao Cui, Han Zhao, Quan Chen, Shulai Zhang, Pengyu Yang, Jing Yang, Shaobo Li, Minyi Guo', 'Status': 'Preprint'}, {'Paper': 'LEGO: Supporting LLM-enhanced Games with One Gaming GPU', 'Authors': 'Han Zhao^, Weihao Cui^, Zeshen Zhang, Wenhao Zhang, Jiangtong Li, Quan Chen, Pu Pang, Zijun Li, Zhenhua Han, Yuqing Yang, Minyi Guo', 'Conference': 'HPCA2026 (CCF-A)'}, {'Paper': 'Towards Resource-Efficient Serverless LLM Inference with SLINFER', 'Authors': 'Chuhao Xu, Zijun Li, Quan Chen, Han Zhao, Xueyan Tang, Minyi Guo', 'Conference': 'HPCA2026 (CCF-A)'}, {'Paper': 'Efficient Performance-Aware GPU Sharing with Compatibility and Isolation through Kernel Space Interception', 'Authors': 'Shulai Zhang, Ao Xu, Quan Chen, Han Zhao, Weihao Cui, Zhen Wang, Yan Li, Limin Xiao, Minyi Guo', 'Conference': 'ATC2025 (CCF-A)'}, {'Paper': 'EDAS: Enabling Fast Data Loading for GPU Serverless Computing', 'Authors': 'Han Zhao, Weihao Cui, Quan Chen, Zijun Li, Zhenhua Han, Nan Wang, Yu Feng, Jieru Zhao, Chen Chen, Jingwen Leng, Minyi Guo', 'Journal': 'TACO2025 (CCF-A)'}, {'Paper': 'Taming Flexible Job Packing in Deep Learning Training Clusters', 'Authors': 'Pengyu Yang, Weihao Cui, Chunyu Xue, Han Zhao, Chen Chen, Quan Chen, Jing Yang, Minyi Guo', 'Journal': 'TACO2025 (CCF-A)'}, {'Paper': 'ARACHNE: Optimizing distributed parallel applications with reduced inter-process communication', 'Authors': 'Yifu He, Han Zhao, Quan Chen, Weihao Cui, Minyi Guo', 'Journal': 'TACO2025 (CCF-A)'}, {'Paper': 'Improving GPU Sharing Performance through Adaptive Bubbleless Spatial-Temporal Sharing', 'Authors': 'Shulai Zhang, Quan Chen, Weihao Cui, Han Zhao, Chunyu Xue, Zhen Zheng, Wei Lin, Minyi Guo', 'Conference': 'Eurosys2025 (CCF-A)'}, {'Paper': 'Potamoi: Accelerating neural rendering via a unified streaming architecture', 'Authors': 'Yu Feng, Weikai Lin, Zihan Liu, Jingwen Leng, Minyi Guo, Han Zhao, Xiaofeng Hou, Jieru Zhao, Yuhao Zhu', 'Journal': 'TACO2024 (CCF-A)'}, {'Paper': 'Adaptive Kernel Fusion for Improving the GPU Utilization', 'Authors': 'Han Zhao^, Junxiao Deng^, Weihao Cui, Quan Chen, Youtao Zhang, Deze Zeng, Minyi Guo', 'Status': 'Incomplete'}]
Research Experience
Currently an assistant professor at the Department of Computer Science and Engineering, Shanghai Jiao Tong University, working closely with Prof. Quan Chen and Assist Prof. Weihao Cui.
Background
Research Interests: Task scheduling, resource management in datacenters, DNN inference system design, cloud computing and deep learning systems, LLM inference and training systems, serverless architectures, etc. Professional Field: Computer Science and Engineering.
Miscellany
Personal Interests: Looking for prospective undergraduate and master's students (Enrollment Date: 2026.09) who are interested in the above research areas.