Scholar
Yuhao Dong
Google Scholar ID: kMui170AAAAJ
Tsinghua University, Nanyang Technological University
Multi-modal Learning
Computer Vision
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
1,092
H-index
14
i10-index
14
Publications
20
Co-authors
14
list available
Contact
No contact links provided.
Publications
22 items
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding
2026
Cited
0
FileGram: Grounding Agent Personalization in File-System Behavioral Traces
2026
Cited
0
PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning
2026
Cited
0
Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models
2026
Cited
0
VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining
2026
Cited
0
Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition
2026
Cited
0
The RoboSense Challenge: Sense Anything, Navigate Anywhere, Adapt Across Platforms
2026
Cited
0
3EED: Ground Everything Everywhere in 3D
2025
Cited
0
Load more
Resume (English only)
Co-authors
14 total
Ziwei Liu
Associate Professor, Nanyang Technological University
Yongming Rao
Tencent Hunyuan
Jingkang Yang
PhD, MMLab@NTU
Zuyan Liu
Tsinghua University
Yuanhan Zhang
PhD Candidate, MMLab@NTU
Brian (Bo) Li
PhD Student@NTU, Singapore
Ranjay Krishna
University of Washington, Allen Institute for AI
Zhaoxi Chen
Ph.D. Student, Nanyang Technological University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up