Scholar

Wenzhe Li

Google Scholar ID: WMC5PsAAAAAJ

Princeton University

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

497

H-index

i10-index

Publications

Co-authors

list available

Contact

TwitterOpen ↗GitHubOpen ↗

Publications

10 items

Continual Harness: Online Adaptation for Self-Improving Foundation Agents

2026

Cited

Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning

2026

Cited

SIQA: Toward Reliable Scientific Image Quality Assessment

2026

Cited

Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following

2025

Cited

Q-Mirror: Unlocking the Multi-Modal Potential of Scientific Text-Only QA Pairs

2025

Cited

The Ever-Evolving Science Exam

2025

Cited

LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra

2025

Cited

Affordance Benchmark for MLLMs

2025

Cited

Resume (English only)

Academic Achievements

Published papers at ICML 2024, ICLR AGI Workshop 2024 (FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning); AAAI 2023 (Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery); NeurIPS 2022 (Latent-Variable Advantage-Weighted Policy Optimization for Offline RL), etc. Ranked 1st on AlpacaEval 2.0 Leaderboard (non-adversarial).

Research Experience

Teaching Assistant for Deep Reinforcement Learning (Tsinghua University, Mar 2022); Teaching Assistant for Artificial Intelligence: Principles and Techniques (Tsinghua University, Sep 2021). Participated in multiple research projects, including the design of a benchmark platform for competitive multi-agent reinforcement learning called FightLadder.

Education

Ph.D. in Electrical and Computer Engineering, Princeton University, 2023 -- Present, Advisor: Chi Jin; B.E. in Computer Science and Technology, Tsinghua University, 2017 -- 2021, Advisors: Chongjie Zhang, Guy Van den Broeck, Stefano Ermon

Background

Research interests include LLM reasoning & agents, reinforcement learning, and multi-agent learning. Has worked on topics such as large language models, multi-agent reinforcement learning, offline reinforcement learning, score-based generative models, and approximate inference.

Miscellany