Zhiding Yu
Scholar

Zhiding Yu

Google Scholar ID: 1VI_oYUAAAAJ
Principal Research Scientist & Research Lead, NVIDIA Research
Computer VsionDeep Learning
Citations & Impact
All-time
Citations
24,839
 
H-index
56
 
i10-index
89
 
Publications
20
 
Co-authors
15
list available
Resume (English only)
Academic Achievements
  • Winner, CVPR24 Challenge on End-to-End Driving at Scale (Hydra-MDP).
  • 2nd Place, CVPR24 Challenge on Driving with Language.
  • Winner, CVPR23 Challenge on 3D Occupancy Prediction (FB-BEV/FB-OCC).
  • Winner, ECCV22 Robust Vision Challenge (RVC) on Semantic Segmentation.
  • Winner, CVPR18 Autonomous Driving Challenge (WAD) on Domain Adaptation.
  • 2nd Place, ICMI15 EmotiW Challenge on Static Facial Expression Recognition.
  • Best Paper Award, BMVC 2020.
  • Best Paper Award, WACV 2015.
  • Best Student Paper Award, ISCSLP 2014.
  • Most Influential NeurIPS Paper Award (SegFormer).
  • Numerous publications listed on Google Scholar.
Background
  • Principal Research Scientist & Research Lead at the Learning & Perception Research Group, NVIDIA Research.
  • Interested in building general autonomy and intelligence across virtual and physical domains.
  • Recent focus includes Vision Transformers, LLMs, multimodal LLMs, and vision-language-action (VLA) models.
  • Applications span open-world understanding, reasoning, AV/robot perception-planning, and agentic systems.
  • Works are characterized by state-of-the-art performance, scalable architectures, and data-centric strategies for real-world generalization.