Scholar

Huiqiang Jiang

Google Scholar ID: 99KtvpYAAAAJ

Microsoft Research Asia

Efficient AILLMsMLSys

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,586

H-index

i10-index

Publications

Co-authors

list available

Contact

GitHubOpen ↗

Publications

21 items

DISA: Offline Importance Sampling for Distribution-Matching LLM-RL

2026

Cited

VecAttention: Vector-wise Sparse Attention for Accelerating Long Context Inference

2026

Cited

SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling

2026

Cited

ParisKV: Fast and Drift-Robust KV-Cache Retrieval for Long-Context LLMs

2026

Cited

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

2025

Cited

MTraining: Distributed Dynamic Sparse Attention for Efficient Ultra-Long Context Training

2025

Cited

Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

2025

Cited

Scaling LLM Test-Time Compute with Mobile NPU on Smartphones

2025

Cited

Resume (English only)

Miscellany

Self-described as a 'fake MLSys/NLPer' and 'unpopular blogger'
Active on personal blog and Zhihu
Programming enthusiast, GitHub: @iofu728
Contact: iofu728@gmail.com, Phone: +86 178 xxxx xxxx
Actively seeking research interns interested in efficient LLM methods

Co-authors

11 total

Lili Qiu

NAI Fellow, ACM Fellow, IEEE Fellow, Professor, Dept. of Computer Science, The University of Texas

Principal Research Manager of Knowledge Computing Group, Microsoft Research Asia

Yucheng Li

University of Surrey

Zhenhua HAN

Microsoft Research Asia

Börje F. Karlsson

Beijing Academy of Artificial Intelligence (BAAI)

Baotong Lu

Microsoft Research