Scholar

Weize Chen

Google Scholar ID: 0CoGHtIAAAAJ

Tsinghua University

NLPML

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

4,863

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailchenweize1998@gmail.com TwitterOpen ↗GitHubOpen ↗

Publications

18 items

SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization

2026

Cited

CPMobius: Iterative Coach-Player Reasoning for Data-Free Reinforcement Learning

2026

Cited

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

2025

Cited

From $f(x)$ and $g(x)$ to $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones

2025

Cited

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

2025

Cited

Cross-Task Experiential Learning on LLM-based Multi-Agent Collaboration

2025

Cited

Co-Saving: Resource Aware Multi-Agent Collaboration for Software Development

2025

Cited

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

2025

Cited

Resume (English only)

Academic Achievements

Paper 'The Overthinker’s DIET: Cutting Token Calories with DIfficulty-Aware Training' accepted at NeurIPS 2025; Blog post 'From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones' released; 'Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System' included in ALC 2025 Findings; 'Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence' spotlighted at ICLR 2025; 'AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors' accepted to ICLR 2024.

Research Experience

Member of THUNLP group; Research involves designing and implementing agent systems for LLMs, exploring agent communication and cooperation, and investigating the mechanism of LLM RL.

Education

Bachelor's degree from the Department of Computer Science and Technology at Tsinghua University; Currently pursuing a PhD at Tsinghua University, advised by Prof. Zhiyuan Liu.

Background

Currently a 4th-year PhD student at Tsinghua University, focusing on natural language processing (NLP) and machine learning (ML), with a particular emphasis on improving the performance and efficiency of agent systems and large language model (LLM) systems. Specific research interests include (Multi-)Agent Systems and Reinforcement Learning.

Co-authors

9 total