Scholar

Yuhang Liu (刘宇航)

Google Scholar ID: RuBpm20AAAAJ

Zhejiang University

GUI Agents(Multimodal) Large Language Models

Citations & Impact

All-time

Citations

H-index

i10-index

Publications

Co-authors

Contact

Publications

7 items

Browse publications on Google Scholar (top-right) ↗

Resume (English only)

Academic Achievements

2025: 'InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization', Under review.
2025: 'InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners', Under review.
2025: 'InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection', WCUA @ ICML 2025.
2025: 'Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models', Under review.
2025: 'InfiR: Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning', Under review.
2024: 'Fine-Grained Guidance for Retrievers: Leveraging LLMs' Feedback in Retrieval-Augmented Generation', CEFSW @ ICMR 2025.

Research Experience

Hands-on experience in developing advanced agents like InfiGUI-R1 and InfiGUIAgent, focusing on bridging the gap between reactive systems and deliberative reasoners in complex, interactive environments.

Education

M.S. Student at Zhejiang University, advised by Prof. Shengyu Zhang; Technical Staff Intern at InfiX.ai, advised by Prof. Hongxia Yang.

Background

Research interests include Large Language Models (LLMs), Multimodal GUI Agents, and Reasoning Enhancement. A Master's student focusing on enhancing the capabilities of AI systems.

Co-authors

0 total

Co-authors: 0 (list not available)