Scholar
Hangdi Xing
Google Scholar ID: 27tv_CEAAAAJ
Student, Zhejiang University
Document Understanding
Vision-Language Models
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
67
H-index
4
i10-index
2
Publications
11
Co-authors
0
Contact
No contact links provided.
Publications
4 items
BrailleLLM: Braille Instruction Tuning with Large Language Models for Braille Domain Tasks
2025
Cited
0
Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning
2025
Cited
0
A Simple yet Effective Layout Token in Large Language Models for Document Understanding
2025
Cited
0
Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding
arXiv.org · 2024
Cited
0
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up