Scholar

Kai Zheng

Google Scholar ID: dxjqLjsAAAAJ

Tencent Hunyuan X

Machine LearningNatural Language Processing

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,441

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailzhengkai@microsoft.com GitHubOpen ↗

Publications

6 items

TurnOPD: Making On-Policy Distillation Turn-Aware for Efficient Long-Horizon Agent Training

2026

Cited

VeriEvol: Scaling Multimodal Mathematical Reasoning via Verifiable Evol-Instruct

2026

Cited

EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation

2026

Cited

Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models

2026

Cited

RubricBench: Aligning Model-Generated Rubrics with Human Standards

2026

Cited

OffSeeker: Online Reinforcement Learning Is Not All You Need for Deep Research Agents

2026

Cited

Resume (English only)

Academic Achievements

Released WizardLM-2, outperforming GPT-4 on MT-Bench, GPT4-Turbo on AlpacaEval 2.0, and Claude 3 Sonnet on Arena-Hard; WizardLM achieved the 1st rank on the Stanford AlpacaEval leaderboard; Published multiple papers in top-tier conferences such as ICLR 2024, ACL 2023, EMNLP 2022, etc.

Research Experience

Worked at Baidu's ERNIE team (responsible for GLUE@Top1), Baidu's LTR team (core search ranking), and Kuaishou's recommendation ranking modeling team, deploying models on products.

Education

Received a master’s degree from the Institute of Computational Linguistics at Peking University, under the supervision of Houfeng Wang.

Background

Research interests include large language models, reinforcement learning, multi-modal LLMs, dialogue systems, and information retrieval. Currently a research scientist at Microsoft AI, contributing core deep models for Microsoft XiaoIce, Bing Search Ranking, and Microsoft Copilot.

Miscellany

Personal projects include WizardLM, WizardCoder, WizardMath, and Evol-Instruct; Looking for highly self-motivated students to work as research interns.

Co-authors

8 total