Scholar
Weilin Cai
Google Scholar ID: dacV5lQAAAAJ
The Hong Kong University of Science and Technology (Guangzhou)
Machine Learning Systems
High Performance Computing
Artificial Intelligence
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
154
H-index
4
i10-index
3
Publications
13
Co-authors
5
list available
Contact
No contact links provided.
Publications
6 items
Accelerating Mixture-of-Experts Inference by Hiding Offloading Latency with Speculative Decoding
2025
Cited
0
DualSparse-MoE: Coordinating Tensor/Neuron-Level Sparsity with Expert Partition and Reconstruction
2025
Cited
0
Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts
2025
Cited
0
MoC-System: Efficient Fault Tolerance for Sparse Mixture-of-Experts Model Training
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2 · 2024
Cited
1
A Survey on Mixture of Experts in Large Language Models
IEEE Transactions on Knowledge and Data Engineering · 2024
Cited
77
Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts
arXiv.org · 2024
Cited
8
Resume (English only)
Co-authors
5 total
Jiayi Huang
The Hong Kong University of Science and Technology (Guangzhou)
Co-author 2
Juyong Jiang
PhD Candidate, The Hong Kong University of Science and Technology
Le Qin
The Hong Kong University of Science and Technology (Guangzhou)
Shwai He
University of Maryland, College Park
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up