Scholar
Yongxin Shi
Google Scholar ID: e-3XAoAAAAAJ
South China University of Technology
Computer Vision
OCR
Multimodal LLMs
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
144
H-index
6
i10-index
3
Publications
15
Co-authors
0
Contact
No contact links provided.
Publications
4 items
URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding
2025
Cited
0
Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoration
2025
Cited
0
OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning
2025
Cited
0
MegaHan97K: A large-scale dataset for mega-category Chinese character recognition with over 97K categories
Pattern Recognition · 2025
Cited
1
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up