Scholar
Ritchie Zhao
Google Scholar ID: 8dswaWgAAAAJ
NVIDIA
computer science
computer architecture
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
2,433
H-index
21
i10-index
26
Publications
20
Co-authors
16
list available
Contact
Email
rz252@cornell.edu
CV
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
7 items
LatentMoE: Toward Optimal Accuracy per FLOP and Parameter in Mixture of Experts
2026
Cited
1
Efficient MoE Serving in the Memory-Bound Regime: Balance Activated Experts, Not Tokens
2025
Cited
0
Helix Parallelism: Rethinking Sharding Strategies for Interactive Multi-Million-Token LLM Decoding
2025
Cited
0
Beyond the Buzz: A Pragmatic Take on Inference Disaggregation
2025
Cited
0
ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration
2025
Cited
0
RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression
2025
Cited
0
Post-Training Quantization for 3D Medical Image Segmentation: A Practical Study on Real Inference Engines
2025
Cited
0
Resume (English only)
Co-authors
16 total
Zhiru Zhang
Cornell University
Eric S Chung
VP of AI Computing, NVIDIA
Bita Darvish Rouhani
Distinguished Engineer, NVIDIA
Steve Dai
NVIDIA Research
Co-author 5
Co-author 6
Jordan Dotzel
Cornell University
Christopher De Sa
Associate Professor of Computer Science, Cornell University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up