Scholar
Zhefeng Wang
Google Scholar ID: t22ZUJ4AAAAJ
Huawei Cloud
NLP
AI system
LLM
multi-modality
Machine Learning
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
1,201
H-index
18
i10-index
30
Publications
20
Co-authors
11
list available
Contact
No contact links provided.
Publications
13 items
Discovering Decoupled Functional Modules in Large Language Models
2026
Cited
0
Cross-Resolution Distribution Matching for Diffusion Distillation
2026
Cited
0
$A^3$: Attention-Aware Accurate KV Cache Fusion for Fast Large Language Model Serving
2025
Cited
0
Adacc: Adaptive Compression and Activation Checkpointing for LLM Memory Management
2025
Cited
0
CaliDrop: KV Cache Compression with Calibration
2025
Cited
0
Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification
2025
Cited
0
Accurate KV Cache Quantization with Outlier Tokens Tracing
2025
Cited
0
Taming the Titans: A Survey of Efficient LLM Inference Serving
2025
Cited
0
Load more
Resume (English only)
Co-authors
11 total
Baoxing Huai
HuaweiCloud
Enhong Chen
University of Science and Technology of China
Min Zhang
Professor of Computer Science, Soochow University
Co-author 4
Jian Pei
Arthur S. Pearse Distinguished Professor, Duke University
Tong Xu
Professor, University of Science and Technology of China
Yu Yang
Associate Professor, City University of Hong Kong
Xiaoye Qu
Shanghai AI Lab
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up