Scholar
Woosuk Kwon
Google Scholar ID: _AT3eUcAAAAJ
PhD student, UC Berkeley
Machine Learning
Systems
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
5,923
H-index
9
i10-index
9
Publications
15
Co-authors
0
Contact
CV
Open ↗
Twitter
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
4 items
Gemma 3 Technical Report
2025
Cited
0
Jenga: Effective Memory Management for Serving LLM with Heterogeneity
2025
Cited
0
APEX: An Extensible and Dynamism-Aware Simulator for Automated Parallel Execution in LLM Serving
2024
Cited
0
Optimizing Speculative Decoding for Serving Large Language Models Using Goodput
arXiv.org · 2024
Cited
23
Resume (English only)
Academic Achievements
Gemma 2: Improving Open Language Models at a Practical Size (arXiv 2024)
Optimizing Speculative Decoding for Serving Large Language Models Using Goodput (arXiv 2024)
Efficient Memory Management for Large Language Model Serving with PagedAttention (SOSP 2023)
SkyPilot: An Intercloud Broker for Sky Computing (NSDI 2023)
A Fast Post-Training Pruning Framework for Transformers (NeurIPS 2022)
Learned Token Pruning for Transformers (KDD 2022)
Nimble: Lightweight and Parallel GPU Task Scheduling for Deep Learning (NeurIPS 2020 Spotlight)
Graphene: Strong yet Lightweight Row Hammer Protection (MICRO 2020, IEEE Micro Top Picks Honorable Mention)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up