Scholar
Bita Darvish Rouhani
Google Scholar ID: oqeh4cYAAAAJ
Distinguished Engineer, NVIDIA
Generative AI
AI Supercomputing
Systems for AI
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
3,222
H-index
26
i10-index
36
Publications
20
Co-authors
9
list available
Contact
No contact links provided.
Publications
6 items
Efficient MoE Serving in the Memory-Bound Regime: Balance Activated Experts, Not Tokens
2025
Cited
0
Pretraining Large Language Models with NVFP4
2025
Cited
0
Helix Parallelism: Rethinking Sharding Strategies for Interactive Multi-Million-Token LLM Decoding
2025
Cited
0
Beyond the Buzz: A Pragmatic Take on Inference Disaggregation
2025
Cited
0
Key, Value, Compress: A Systematic Exploration of KV Cache Compression Techniques
2025
Cited
0
ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration
2025
Cited
0
Resume (English only)
Co-authors
9 total
Farinaz Koushanfar
Professor and Siavouche Nemat-Nasser Endowed Chair of ECE, UC San Diego
Eric S Chung
VP of AI Computing, NVIDIA
Co-author 3
Azalia Mirhoseini
Assistant Professor of Computer Science, Stanford - Google DeepMind
Steven K. Reinhardt
AMD
Maxim Naumov
Meta (Director of Engineering & Research)
Co-author 7
Pradeep Dubey
Intel Corporation
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up