Yuhang Liu (刘宇航)
Scholar

Yuhang Liu (刘宇航)

Google Scholar ID: RuBpm20AAAAJ
Zhejiang University
GUI Agents(Multimodal) Large Language Models
Citations & Impact
All-time
Citations
88
 
H-index
4
 
i10-index
2
 
Publications
7
 
Co-authors
0
 
Publications
7 items
Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
  • 2025: 'InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization', Under review.
  • 2025: 'InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners', Under review.
  • 2025: 'InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection', WCUA @ ICML 2025.
  • 2025: 'Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models', Under review.
  • 2025: 'InfiR: Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning', Under review.
  • 2024: 'Fine-Grained Guidance for Retrievers: Leveraging LLMs' Feedback in Retrieval-Augmented Generation', CEFSW @ ICMR 2025.
Research Experience
  • Hands-on experience in developing advanced agents like InfiGUI-R1 and InfiGUIAgent, focusing on bridging the gap between reactive systems and deliberative reasoners in complex, interactive environments.
Education
  • M.S. Student at Zhejiang University, advised by Prof. Shengyu Zhang; Technical Staff Intern at InfiX.ai, advised by Prof. Hongxia Yang.
Background
  • Research interests include Large Language Models (LLMs), Multimodal GUI Agents, and Reasoning Enhancement. A Master's student focusing on enhancing the capabilities of AI systems.
Co-authors
0 total
Co-authors: 0 (list not available)