Paper 'The Overthinker’s DIET: Cutting Token Calories with DIfficulty-Aware Training' accepted at NeurIPS 2025; Blog post 'From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones' released; 'Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System' included in ALC 2025 Findings; 'Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence' spotlighted at ICLR 2025; 'AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors' accepted to ICLR 2024.
Research Experience
Member of THUNLP group; Research involves designing and implementing agent systems for LLMs, exploring agent communication and cooperation, and investigating the mechanism of LLM RL.
Education
Bachelor's degree from the Department of Computer Science and Technology at Tsinghua University; Currently pursuing a PhD at Tsinghua University, advised by Prof. Zhiyuan Liu.
Background
Currently a 4th-year PhD student at Tsinghua University, focusing on natural language processing (NLP) and machine learning (ML), with a particular emphasis on improving the performance and efficiency of agent systems and large language model (LLM) systems. Specific research interests include (Multi-)Agent Systems and Reinforcement Learning.