Selected Publications: Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities; Heterogeneous Swarms: Jointly Optimizing Model Roles and Weights for Multi-LLM Systems; Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence; Magnet: Multi-turn Tool-use Data Synthesis and Distillation via Graph Translation.
Research Experience
Aug 2023 - Present, Google, Research Scientist at Cloud AI Research; Sep 2018 - July 2023, Northeastern University, Research Assistant at SPIRAL Group; June 2021 - Jan 2023, Google, Student Researcher / Research Intern at Cloud AI Research; Feb 2017 - July 2018, Tsinghua University, Research Assistant at i-Vision Group; July 2017 - Sep 2017, University of Michigan, Visiting Researcher at Vision & Learning Lab.
Education
PhD in Machine Learning from Northeastern University's SPIRAL Group, advised by Prof. Jennifer G. Dy, with close collaboration with Prof. Stratis Ioannidis and Prof. Yanzhi Wang. BS in Electronic Engineering from Tsinghua University, where he worked on computer vision with Prof. Jiwen Lu (Tsinghua) and Prof. Jia Deng (Princeton), and big data with Prof. Yong Li (Tsinghua).
Background
Research interests include large language models (LLMs) and their applications, specifically model adaptation, multi-LLM collaboration, and multi-agent systems.
Miscellany
Looking for self-motivated student researchers / research interns with interests and expertise in LLMs.