Yue Fan
Scholar

Yue Fan

Google Scholar ID: 1NfBa5sAAAAJ
Ph.D candidate, University of California, Santa Cruz
Computer VisionNLPAgent
Citations & Impact
All-time
Citations
455
 
H-index
9
 
i10-index
9
 
Publications
17
 
Co-authors
15
list available
Resume (English only)
Academic Achievements
  • GRIT: Teaching MLLMs to Think with Images – NeurIPS 2025 (first author)
  • GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration – EMNLP 2025 (first author)
  • MMIR paper accepted as a Findings paper at ACL 2025
  • LLM-Coordination paper accepted at NAACL 2025
  • Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding – EMNLP 2024 (first author)
  • Muffin or Chihuahua? Challenging Large Vision-Language Models with Multipanel VQA – ACL 2024 (first author)
  • R2H: Building Multimodal Navigation Helpers that Respond to Help Requests – EMNLP 2023 (first author)
  • Athena 3.0: Personalized Multimodal ChatBot with Neuro-Symbolic Dialogue Generators – Alexa Prize SocialBot Grand Challenge 5 (first author)
  • Aerial Vision-and-Dialog Navigation – ACL 2023 (first author)
  • JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents – Preprint 2022 (co-first author)
  • Learn by Observation: Imitation Learning for Drone Patrolling from Videos of A Human Navigator – IROS 2020 (first author)
  • Passed Ph.D. qualification exam and became a Ph.D. candidate in April 2024