Scholar
Hanfei Yu
Google Scholar ID: _ECL3GIAAAAJ
Stevens Institute of Technology
Serverless Computing
Large-Scale AI Systems
Distributed ML Systems
LLM Systems
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
331
H-index
8
i10-index
8
Publications
15
Co-authors
9
list available
Contact
Email
hyu42@stevens.edu
CV
Open ↗
GitHub
Open ↗
Publications
5 items
The Diminishing Returns of Early-Exit Decoding in Modern LLMs
2026
Cited
0
MoEless: Efficient MoE LLM Serving via Serverless Computing
2026
Cited
0
RLHFless: Serverless Computing for Efficient RLHF
2026
Cited
0
ServerlessLoRA: Minimizing Latency and Cost in Serverless Inference for LoRA-Based LLMs
2025
Cited
0
fMoE: Fine-Grained Expert Offloading for Large Mixture-of-Experts Serving
2025
Cited
0
Resume (English only)
Academic Achievements
Recipient of the SoCC’24 Best Paper Award
SC’24 Best Student Paper Finalist
Selected as one of the 2025 ML and Systems Rising Stars
Publications in top-tier venues including SoCC’25, VLDB’25, SC’24, AAAI’24, SoCC’24, ASPLOS’24, TPDS’24, HPDC’23, WWW’22, ACSOS’21, EuroSys’26
Served on program committees or as reviewer for ICLR’26, ICPADS’25, SOSP’25, AAAI’26, EuroSys’26
Co-authors
9 total
Hao Wang
Assistant Professor, ECE at Stevens Institute of Technology
Jian Li
Assistant Professor, Stony Brook University
Seung-Jong Park
Computer Science, Missouri University of Science & Technology
Co-author 4
Xu Yuan
University of Delaware
Hong Zhang
University of Waterloo
Rohan Basu Roy
Assistant Professor, University of Utah
Yifan Sui
Shanghai Jiao Tong University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up