AgoraResearch hub
ExploreLibraryProfile
Account
Ritchie Zhao
Scholar

Ritchie Zhao

Google Scholar ID: 8dswaWgAAAAJ
NVIDIA
computer sciencecomputer architecture
Homepage↗Google Scholar↗
Citations & Impact
All-time
Citations
2,433
 
H-index
21
 
i10-index
26
 
Publications
20
 
Co-authors
16
list available
Contact
Emailrz252@cornell.eduCVOpen ↗GitHubOpen ↗LinkedInOpen ↗
Publications
7 items
LatentMoE: Toward Optimal Accuracy per FLOP and Parameter in Mixture of Experts
2026
Cited
1
Efficient MoE Serving in the Memory-Bound Regime: Balance Activated Experts, Not Tokens
2025
Cited
0
Helix Parallelism: Rethinking Sharding Strategies for Interactive Multi-Million-Token LLM Decoding
2025
Cited
0
Beyond the Buzz: A Pragmatic Take on Inference Disaggregation
2025
Cited
0
ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration
2025
Cited
0
RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression
2025
Cited
0
Post-Training Quantization for 3D Medical Image Segmentation: A Practical Study on Real Inference Engines
2025
Cited
0
Resume (English only)
Co-authors
16 total
Zhiru Zhang
Zhiru Zhang
Cornell University
Eric S Chung
Eric S Chung
VP of AI Computing, NVIDIA
Bita Darvish Rouhani
Bita Darvish Rouhani
Distinguished Engineer, NVIDIA
Steve Dai
Steve Dai
NVIDIA Research
Co-author 5
Co-author 5
Co-author 6
Co-author 6
Jordan Dotzel
Jordan Dotzel
Cornell University
Christopher De Sa
Christopher De Sa
Associate Professor of Computer Science, Cornell University

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?