Scholar
Yuhang Cao
Google Scholar ID: sJkqsqkAAAAJ
MMLab The Chinese University of Hong Kong
Multi-Modal Large Language Model
Object Detection
Few Shot Object Detection
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
7,058
H-index
19
i10-index
27
Publications
20
Co-authors
0
Contact
No contact links provided.
Publications
33 items
Visual Self-Refine: A Pixel-Guided Paradigm for Accurate Chart Parsing
2026
Cited
0
Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition
2026
Cited
0
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning
2025
Cited
0
Think Visually, Reason Textually: Vision-Language Synergy in ARC
2025
Cited
0
Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning
2025
Cited
0
STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence
2025
Cited
0
LSVOS 2025 Challenge Report: Recent Advances in Complex Video Object Segmentation
2025
Cited
0
OMeGa: Joint Optimization of Explicit Meshes and Gaussian Splats for Robust Scene-Level Surface Reconstruction
2025
Cited
0
Load more
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up