Scholar
Xiuhong Li
Google Scholar ID: 90eREm0AAAAJ
Infinigence-AI
Deep Learning System
GPGPU
Deep Learning Compiler
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
1,106
H-index
16
i10-index
24
Publications
20
Co-authors
12
list available
Contact
No contact links provided.
Publications
9 items
NanoCP: Request-Level Dynamic Context Parallelism for Data-Expert Parallel Decoding
2026
Cited
0
TASP: Topology-aware Sequence Parallelism
2025
Cited
0
Zeppelin: Balancing Variable-length Workloads in Data Parallel Large Model Training
2025
Cited
0
Past-Future Scheduler for LLM Serving under SLA Guarantees
2025
Cited
0
MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design
2025
Cited
0
FlashOverlap: A Lightweight Design for Efficiently Overlapping Communication and Computation
2025
Cited
0
semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage
2025
Cited
0
MBQ: Modality-Balanced Quantization for Large Vision-Language Models
2024
Cited
0
Load more
Resume (English only)
Co-authors
12 total
Yun (Eric) Liang
Professor of EECS, Peking University, ACM Distinguished Scientist
Ke Hong
Tsinghua University
Guohao Dai
Associate Professor of Shanghai Jiao Tong University
Yu Wang (汪玉)
Department of Electronic Engineering, Tsinghua University, China
Dahua Lin
The Chinese University of Hong Kong
Jiangfei Duan
The Chinese University of Hong Kong
Xingcheng ZHANG
Shanghai AI lab
Size Zheng
ByteDance Seed
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up