Conference on Empirical Methods in Natural Language Processing · 2024
Cited
3
Resume (English only)
Academic Achievements
Paper 'LeMix: Unified Scheduling for LLM Training and Inference on Multi-GPU Systems' received the Outstanding Paper Award at RTSS 2025.
Paper 'Contextual Bandits with Large Action Spaces: Made Practical' was incorporated into the leading machine learning library Vowpal Wabbit.
Paper 'Contextual Bandits with Smooth Regret: Computational Efficiency in Continuous Action Spaces' was selected as a full oral presentation at ICML 2022 (top 2.1%).
Published multiple papers at top-tier conferences including NeurIPS, ICML, AISTATS, and EMNLP, covering topics such as active learning, contextual bandits, pure exploration linear bandits, kernel/neural bandits, and sequential decision-making with LLMs.
Several recent works (2025) are preprints under review, addressing test-time matching, online finetuning of Decision Transformers, strategic scaling of test-time compute, and multimodal active learning.