Scholar
Gabriele Oliaro
Google Scholar ID: 6-evBPAAAAAJ
Carnegie Mellon University, Snowflake AI Research
Machine Learning
Distributed Systems
Parallel Computing
Networking
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
599
H-index
9
i10-index
9
Publications
10
Co-authors
14
list available
Contact
No contact links provided.
Publications
8 items
FastKernels: Benchmarking GPU Kernel Generation in Production
2026
Cited
0
Event Tensor: A Unified Abstraction for Compiling Dynamic Megakernel
2026
Cited
0
OWL: Overcoming Window Length-Dependence in Speculative Decoding for Long-Context Inputs
2025
Cited
0
SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning
2025
Cited
0
AdaServe: SLO-Customized LLM Serving with Fine-Grained Speculative Decoding
2025
Cited
0
SuffixDecoding: A Model-Free Approach to Speeding Up Large Language Model Inference
arXiv.org · 2024
Cited
4
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning
arXiv.org · 2024
Cited
8
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems
arXiv.org · 2023
Cited
87
Resume (English only)
Co-authors
14 total
Zhihao Jia
Assistant Professor of Computer Science, Carnegie Mellon University
Xupeng Miao
Purdue University
April Yang
NVIDIA
Minlan Yu
Harvard University
Zikun Li
Carnegie Mellon University
Shuhuai Lin
Carnegie Mellon University
Zhuofu Chen
Princeton University
Aurick Qiao
Snowflake AI Research
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up