Scholar
Zhibo Yang
Google Scholar ID: X3K4jQwAAAAJ
Alibaba Group; Tsinghua University
OCR
MLLMs
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
3,377
H-index
17
i10-index
25
Publications
20
Co-authors
4
list available
Contact
No contact links provided.
Publications
6 items
Learning Transferable Temporal Primitives for Video Reasoning via Synthetic Videos
2026
Cited
0
CodePercept: Code-Grounded Visual STEM Perception for MLLMs
2026
Cited
0
From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning
2026
Cited
0
UNIKIE-BENCH: Benchmarking Large Multimodal Models for Key Information Extraction in Visual Documents
2026
Cited
0
BabyVision: Visual Reasoning Beyond Language
arXiv.org · 2026
Cited
4
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking
arXiv.org · 2026
Cited
9
Resume (English only)
Co-authors
4 total
Xiang Bai
Huazhong University of Science and Technology (HUST)
Cong Yao
Alibaba DAMO Academy
Junyang Lin
Qwen Team, Alibaba Group & Peking University
Lianwen Jin
Professor of Electronic and Information Engineering, South China University of Technology
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up