Gengze Zhou
Scholar

Gengze Zhou

Google Scholar ID: Uu8bkGgAAAAJ
The University of Adelaide
Embodied AIMultimodality
Citations & Impact
All-time
Citations
534
 
H-index
4
 
i10-index
4
 
Publications
8
 
Co-authors
0
 
Resume (English only)
Academic Achievements
  • - Paper 'SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts' accepted to ICCV 2025
  • - Paper 'NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models' accepted to ECCV 2024
  • - Paper 'NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models' accepted to AAAI 2024
  • - Paper 'WebVLN: Vision-and-Language Navigation on Websites' accepted to AAAI 2024
  • - Paper 'NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation' accepted to RSS 2024
Research Experience
  • - Research Intern at Adobe Research, working on text-to-video generation
  • - Member of V3A Lab, University of Adelaide
  • - Teaching Assistant: COMP8536 - Deep Learning, ANU, 2022
  • - Master Student Supervisor: COMP7205 - Individual Research Project on Embodied MLLM Agents Evaluation, University of Adelaide, 2025
Education
  • - Ph.D. student at Australian Institute for Machine Learning (AIML), University of Adelaide, supervised by A/Prof. Qi Wu and Dr. Yicong Hong
  • - Master's student at Australian National University, supervised by Prof. Stephen Gould
  • - Bachelor's degree from Dalian University of Technology
Background
  • Research interests include creating explainable and embodied AI systems that can dynamically interact with both humans and their environments. The goal is to build an autonomous agent that can understand, reason, and navigate the physical world, while seamlessly communicating with humans in natural language. By integrating machine learning with visual and linguistic applications, he strives to enhance the transparency and interpretability of AI decision-making, fostering more natural and effective human-AI interactions.
Miscellany
  • Conference Reviewer: CVPR’(24, 25), MM’(24), EMNLP’(24, 25), AAAI’(25), ICRA’(25), ICLR’(25), NAACL’(25), ACL’(25), ICCV’(25), IROS’(25), NeurIPS’(25)
  • Journal Reviewer: TPAMI, TCSVT, RAL
Co-authors
0 total
Co-authors: 0 (list not available)