Shihao Wang
Scholar

Shihao Wang

Google Scholar ID: 7TWugs4AAAAJ
Hong Kong Polytechnic University
deep learningautonomous drivingvision language models
Citations & Impact
All-time
Citations
816
 
H-index
8
 
i10-index
7
 
Publications
13
 
Co-authors
11
list available
Resume (English only)
Academic Achievements
  • - Publications:
  • - VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding (arXiv 2025)
  • - GR00T N1.5 An Improved Open Foundation Model for Generalist Humanoid Robots (Blog 2025)
  • - Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training (ICCV 2025)
  • - Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models (NIPS 2025)
  • - Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models (arXiv)
  • - Awards:
  • - First Place in End-to-End Driving at Scale, Second Place in Driving with Language, CVPR 2024 Autonomous Driving Grand Challenge
  • - Projects:
  • - StreamPETR (ICCV’23), a streaming paradigm for camera-based 3D perception that reached #1 among online methods on nuScenes and has been widely adopted in both academia and industry
Research Experience
  • - Work Experience:
  • - Joined NVIDIA AV Applied Research Group as a Research Intern in October 2023
  • - Joined MEGVII Technology Foundation Model Group as a Research Intern in October 2022
  • - Research Projects:
  • - Eagle-VLM series, powering NVIDIA's commercial multimodal models
  • - Isaac GR00T humanoid robotics platform
  • - OmniDrive and Hydra-MDP, connecting 3D perception with multimodal reasoning for end-to-end autonomous driving
Education
  • - Degree: Ph.D. Student
  • - School: The Hong Kong Polytechnic University
  • - Advisor: Prof. Lei Zhang
  • - Time: Currently enrolled
  • - Major: Computing
Background
  • - Research Interests: 3D perception and planning, multimodal foundation models, streaming video understanding, test-time adaptation, etc.
  • - Professional Field: Department of Computing, particularly in autonomous driving and robotics
  • - Introduction: A second-year Ph.D. student in the Department of Computing at The Hong Kong Polytechnic University, advised by Prof. Lei Zhang. Closely collaborates with NVIDIA Research.