Released WizardLM-2, outperforming GPT-4 on MT-Bench, GPT4-Turbo on AlpacaEval 2.0, and Claude 3 Sonnet on Arena-Hard; WizardLM achieved the 1st rank on the Stanford AlpacaEval leaderboard; Published multiple papers in top-tier conferences such as ICLR 2024, ACL 2023, EMNLP 2022, etc.
Research Experience
Worked at Baidu's ERNIE team (responsible for GLUE@Top1), Baidu's LTR team (core search ranking), and Kuaishou's recommendation ranking modeling team, deploying models on products.
Education
Received a master’s degree from the Institute of Computational Linguistics at Peking University, under the supervision of Houfeng Wang.
Background
Research interests include large language models, reinforcement learning, multi-modal LLMs, dialogue systems, and information retrieval. Currently a research scientist at Microsoft AI, contributing core deep models for Microsoft XiaoIce, Bing Search Ranking, and Microsoft Copilot.
Miscellany
Personal projects include WizardLM, WizardCoder, WizardMath, and Evol-Instruct; Looking for highly self-motivated students to work as research interns.