Scholar

Kuntai Du

Google Scholar ID: cY5PxOQAAAAJ

University of Chicago

Large Language ModelsVideo analytics

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

550

H-index

8

i10-index

7

Publications

20

Co-authors

0

Contact

No contact links provided.

Publications

10 items

VeriCache: Turning Lossy KV Cache into Lossless LLM Inference

2026

Cited

0

EVICPRESS: Joint KV-Cache Compression and Eviction for Efficient LLM Serving

2025

Cited

0

LMCache: An Efficient KV Cache Layer for Enterprise-Scale LLM Inference

2025

Cited

0

AdaptCache: KV Cache Native Storage Hierarchy for Low-Delay and High-Quality Language Model Serving

2025

Cited

0

PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications

2025

Cited

0

Jenga: Effective Memory Management for Serving LLM with Heterogeneity

2025

Cited

0

Towards More Economical Context-Augmented LLM Generation by Reusing Stored KV Cache

2025

Cited

0

RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation

arXiv.org · 2024

Cited

7

Resume (English only)

Co-authors

0 total

Co-authors: 0 (list not available)