AgoraResearch hub
ExploreLibraryProfile
Account
Gabriele Oliaro
Scholar

Gabriele Oliaro

Google Scholar ID: 6-evBPAAAAAJ
Carnegie Mellon University, Snowflake AI Research
Machine LearningDistributed SystemsParallel ComputingNetworking
Homepage↗Google Scholar↗
Citations & Impact
All-time
Citations
599
 
H-index
9
 
i10-index
9
 
Publications
10
 
Co-authors
14
list available
Contact
No contact links provided.
Publications
8 items
FastKernels: Benchmarking GPU Kernel Generation in Production
2026
Cited
0
Event Tensor: A Unified Abstraction for Compiling Dynamic Megakernel
2026
Cited
0
OWL: Overcoming Window Length-Dependence in Speculative Decoding for Long-Context Inputs
2025
Cited
0
SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning
2025
Cited
0
AdaServe: SLO-Customized LLM Serving with Fine-Grained Speculative Decoding
2025
Cited
0
SuffixDecoding: A Model-Free Approach to Speeding Up Large Language Model Inference
arXiv.org · 2024
Cited
4
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning
arXiv.org · 2024
Cited
8
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems
arXiv.org · 2023
Cited
87
Resume (English only)
Co-authors
14 total
Zhihao Jia
Zhihao Jia
Assistant Professor of Computer Science, Carnegie Mellon University
Xupeng Miao
Xupeng Miao
Purdue University
April Yang
April Yang
NVIDIA
Minlan Yu
Minlan Yu
Harvard University
Zikun Li
Zikun Li
Carnegie Mellon University
Shuhuai Lin
Shuhuai Lin
Carnegie Mellon University
Zhuofu Chen
Zhuofu Chen
Princeton University
Aurick Qiao
Aurick Qiao
Snowflake AI Research

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?