Scholar
Shiqing Fan
Google Scholar ID: 2DfQpHAAAAAJ
NVIDIA
Deep Learning
Distributed Systems
E2E Optimization
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
374
H-index
5
i10-index
2
Publications
8
Co-authors
4
list available
Contact
No contact links provided.
Publications
4 items
Scalable Training of Mixture-of-Experts Models with Megatron Core
2026
Cited
0
MCPToolBench++: A Large Scale AI Agent Model Context Protocol MCP Tool Use Benchmark
2025
Cited
0
MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core
2025
Cited
0
Upcycling Large Language Models into Mixture of Experts
arXiv.org · 2024
Cited
12
Resume (English only)
Co-authors
4 total
Chuan Wu
Professor of Computer Science, The University of Hong Kong
Co-author 2
Co-author 3
Co-author 4
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up