Scholar
Zhibo Yang
Google Scholar ID: X3K4jQwAAAAJ
Alibaba Group; Tsinghua University
OCR
MLLMs
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
3,377
H-index
17
i10-index
25
Publications
20
Co-authors
4
list available
Contact
No contact links provided.
Publications
11 items
MPDocBench-Parse: Benchmarking Practical Multi-page Document Parsing
2026
Cited
0
Multi-domain Multi-modal Document Classification Benchmark with a Multi-level Taxonomy
2026
Cited
0
Qwen3-VL-Seg: Unlocking Open-World Referring Segmentation with Vision-Language Grounding
2026
Cited
0
CC-OCR V2: Benchmarking Large Multimodal Models for Literacy in Real-world Document Processing
2026
Cited
0
Triviality Corrected Endogenous Reward
2026
Cited
0
Learning Transferable Temporal Primitives for Video Reasoning via Synthetic Videos
2026
Cited
0
CodePercept: Code-Grounded Visual STEM Perception for MLLMs
2026
Cited
0
From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning
2026
Cited
0
Load more
Resume (English only)
Co-authors
4 total
Xiang Bai
Huazhong University of Science and Technology (HUST)
Cong Yao
Alibaba DAMO Academy
Junyang Lin
Qwen Team, Alibaba Group & Peking University
Lianwen Jin
Professor of Electronic and Information Engineering, South China University of Technology
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up