Scholar
Yizhou Shan
Google Scholar ID: qgxGqYAAAAAJ
Huawei Cloud
Disaggregation
Operating System
Distributed System
Computer Architecture
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,456
H-index
14
i10-index
15
Publications
20
Co-authors
0
Contact
No contact links provided.
Publications
7 items
ReviveMoE: Fast Recovery for Hardware Failures in Large-Scale MoE LLM Inference Deployments
2026
Cited
0
Efficient Serving of LLM Applications with Probabilistic Demand Modeling
2025
Cited
0
DDiT: Dynamic Resource Allocation for Diffusion Transformer Model Serving
2025
Cited
0
Efficient Long-Decoding Inference with Reasoning-Aware Attention Sparsity
2025
Cited
0
DeepFlow: Serverless Large Language Model Serving at Scale
2025
Cited
0
Fast and Live Model Auto Scaling with O(1) Host Caching
arXiv.org · 2024
Cited
4
EPIC: Efficient Position-Independent Context Caching for Serving Large Language Models
arXiv.org · 2024
Cited
1
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up