Scholar
Cong Yao
Google Scholar ID: IpmnLFcAAAAJ
Alibaba DAMO Academy
Computer Vision
Vision-Language Models
OCR
Document Understanding
Scene Text Detection and Recognition
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
15,731
H-index
44
i10-index
64
Publications
20
Co-authors
9
list available
Contact
No contact links provided.
Publications
4 items
Spectral entropy prior-guided deep feature fusion architecture for magnetic core loss
2025
Cited
0
Generative Compositor for Few-Shot Visual Information Extraction
2025
Cited
0
ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data
arXiv.org · 2024
Cited
0
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding
International Journal on Document Analysis and Recognition · 2022
Cited
18
Resume (English only)
Co-authors
9 total
Xiang Bai
Huazhong University of Science and Technology (HUST)
Co-author 2
Xinggang Wang
Professor, Huazhong University of Science and Technology
Zhuowen Tu
Professor, Cognitive Science, Computer Science&Engineering, UC San Diego
Wei Shen (沈为)
Professor, Shanghai Jiao Tong University
Yi Ma (马毅)
Director of School of Computing & Data Science, HKU; Visiting Professor of EECS, Berkeley
Yingying Zhu
PhD student, Dept. of EI, Huazhong University of Science and Technology
LONGIN JAN LATECKI
Professor of Computer Science, Temple University, Philadelphia
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up