Scholar
Bita Darvish Rouhani
Google Scholar ID: oqeh4cYAAAAJ
Distinguished Engineer, NVIDIA
Generative AI
AI Supercomputing
Systems for AI
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
3,655
H-index
28
i10-index
42
Publications
20
Co-authors
9
list available
Contact
No contact links provided.
Publications
8 items
Guess-Verify-Refine: Data-Aware Top-K for Sparse-Attention Decoding on Blackwell via Temporal Correlation
2026
Cited
0
SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding
2026
Cited
1
Efficient MoE Serving in the Memory-Bound Regime: Balance Activated Experts, Not Tokens
2025
Cited
0
Pretraining Large Language Models with NVFP4
2025
Cited
0
Helix Parallelism: Rethinking Sharding Strategies for Interactive Multi-Million-Token LLM Decoding
2025
Cited
0
Beyond the Buzz: A Pragmatic Take on Inference Disaggregation
2025
Cited
0
Key, Value, Compress: A Systematic Exploration of KV Cache Compression Techniques
2025
Cited
0
ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration
2025
Cited
0
Resume (English only)
Co-authors
9 total
Farinaz Koushanfar
Professor and Siavouche Nemat-Nasser Endowed Chair of ECE, UC San Diego
Eric S Chung
VP of AI Computing, NVIDIA
Co-author 3
Azalia Mirhoseini
Assistant Professor of Computer Science, Stanford - Google DeepMind
Steven K. Reinhardt
AMD
Maxim Naumov
Meta (Director of Engineering & Research)
Co-author 7
Pradeep Dubey
Intel Corporation
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up