Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
2025: 'InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization', Under review.
2025: 'InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners', Under review.
2025: 'InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection', WCUA @ ICML 2025.
2025: 'Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models', Under review.
2025: 'InfiR: Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning', Under review.
2024: 'Fine-Grained Guidance for Retrievers: Leveraging LLMs' Feedback in Retrieval-Augmented Generation', CEFSW @ ICMR 2025.
Research Experience
Hands-on experience in developing advanced agents like InfiGUI-R1 and InfiGUIAgent, focusing on bridging the gap between reactive systems and deliberative reasoners in complex, interactive environments.
Education
M.S. Student at Zhejiang University, advised by Prof. Shengyu Zhang; Technical Staff Intern at InfiX.ai, advised by Prof. Hongxia Yang.
Background
Research interests include Large Language Models (LLMs), Multimodal GUI Agents, and Reasoning Enhancement. A Master's student focusing on enhancing the capabilities of AI systems.