Scholar
Ke Hong
Google Scholar ID: 134qQG4AAAAJ
Tsinghua University
efficient computing
GPU acceleration
sparse computing
ML system
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
537
H-index
9
i10-index
9
Publications
18
Co-authors
9
list available
Contact
CV
Open ↗
Publications
7 items
db-SP: Accelerating Sparse Attention for Visual Generative Models with Dual-Balanced Sequence Parallelism
2025
Cited
0
Reducing Latency of LLM Search Agent via Speculation-based Algorithm-System Co-Design
2025
Cited
0
TASP: Topology-aware Sequence Parallelism
2025
Cited
0
PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models
2025
Cited
0
semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage
2025
Cited
0
FlashOverlap: A Lightweight Design for Efficiently Overlapping Communication and Computation
2025
Cited
0
MBQ: Modality-Balanced Quantization for Large Vision-Language Models
2024
Cited
0
Resume (English only)
Co-authors
9 total
Guohao Dai
Associate Professor of Shanghai Jiao Tong University
Yu Wang (汪玉)
Department of Electronic Engineering, Tsinghua University, China
Xiuhong Li
Infinigence-AI
Jiaming Xu
Shanghai Jiao Tong University
Xuefei Ning
Tsinghua University
Shiyao Li (李师尧)
Ph.D student, Tsinghua University
Tianyu Fu
Ph.D at Tsinghua University
Xinhao Yang
Tsinghua University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up