Scholar
Mukul Gagrani
Google Scholar ID: ERZXPy4AAAAJ
Qualcomm AI Research
Efficient LLM
Reinforcement Learning
Combinatorial Optimization
Stochastic Control
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
403
H-index
10
i10-index
10
Publications
20
Co-authors
12
list available
Contact
GitHub
Open ↗
LinkedIn
Open ↗
Publications
5 items
Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing
2026
Cited
0
ConFu: Contemplate the Future for Better Speculative Sampling
2026
Cited
0
Fast Forward: Accelerating LLM Prefill with Predictive FFN Sparsity
2026
Cited
0
VOCABTRIM: Vocabulary Pruning for Efficient Speculative Decoding in LLMs
2025
Cited
0
CAOTE: KV Caching through Attention Output Error based Token Eviction
2025
Cited
0
Resume (English only)
Co-authors
12 total
Ashutosh Nayyar
University of Southern California
Yi Ouyang
Preferred Networks
Co-author 3
Rahul Jain
Professor of ECE and CS, USC and Research Scientist, Google DeepMind
Co-author 5
Harris Teague
Qualcomm, Inc.
Aditya Mahajan
McGill University
Marcos M. Vasconcelos
Assistant Professor, Florida State University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up