Scholar
Gangyan Zeng
Google Scholar ID: dMHfDZYAAAAJ
Nanjing University of Science and Technology
Computer Vision
OCR
Multimodal Learning
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
211
H-index
8
i10-index
7
Publications
20
Co-authors
2
list available
Contact
No contact links provided.
Publications
7 items
Towards Training-Free Scene Text Editing
2026
Cited
0
Towards Real-World Document Parsing via Realistic Scene Synthesis and Document-Aware Training
2026
Cited
0
MMTIT-Bench: A Multilingual and Multi-Scenario Benchmark with Cognition-Perception-Reasoning Guided Text-Image Machine Translation
2026
Cited
0
Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark
2025
Cited
0
Gather and Trace: Rethinking Video TextVQA from an Instance-oriented Perspective
2025
Cited
0
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
2025
Cited
0
VidText: Towards Comprehensive Evaluation for Video Text Understanding
2025
Cited
0
Resume (English only)
Co-authors
2 total
Yu ZHOU (周宇)
Nankai University, China
Xugong Qin
Nanjing University of Science and Technology
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up