Scholar
Maoyuan Ye
Google Scholar ID: Xy8cr_4AAAAJ
Wuhan University
CV
OCR
LLM
MLLM
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
281
H-index
5
i10-index
3
Publications
9
Co-authors
7
list available
Contact
Email
yemaoyuan@whu.edu.cn
GitHub
Open ↗
Publications
5 items
ET-SAM: Efficient Point Prompt Prediction in SAM for Unified Scene Text Detection and Layout Analysis
2026
Cited
0
Adapting Segment Anything Model for Power Transmission Corridor Hazard Segmentation
2025
Cited
0
GoMatching++: Parameter- and Data-Efficient Arbitrary-Shaped Video Text Spotting and Benchmarking
2025
Cited
0
Reasoning-OCR: Can Large Multimodal Models Solve Complex Logical Reasoning Problems from OCR Cues?
2025
Cited
0
LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?
2025
Cited
0
Resume (English only)
Academic Achievements
- Publications:
- IEEE TPAMI: Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
- NeurIPS 2024: GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching
- CVPR 2023: DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting
- AAAI 2023: DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
- Preprints:
- arxiv preprint: LogicOCR, Reasoning-OCR, Adapting Segment Anything Model for Power Transmission Corridor Hazard Segmentation, GoMatching++, DeepSolo++
- Awards: Not explicitly mentioned
Research Experience
- Work Experience: Interned at JD Explore Academy and iFLYTEK Research
- Research Projects: Involved in multiple projects related to OCR and video text spotting
Education
- Degree: Ph.D.
- University: Wuhan University
- Advisors: Prof. Bo Du, Prof. Juhua Liu
- Time: Currently enrolled
- Major: Computer Science
Background
- Research Interests: Computer Vision, Large Language Models, Multimodal Large Language Models
- Professional Field: OCR-related topics, now focusing on Multimodal Large Language Models
- Introduction: First-year Ph.D. student at the School of Computer Science, Wuhan University
Miscellany
- Personal Interests: Not explicitly mentioned
- Other: Served as a reviewer for several international conferences and journals
Co-authors
7 total
bo du
School of Computer Science, Wuhan University
Juhua Liu(刘菊华)
Wuhan University
Dacheng Tao
Nanyang Technological University
Jing Zhang (张敬)
Wuhan University, previously at The University of Sydney
Shanshan Zhao
Alibaba
Chenyu Liu
iFLYTEK Research, USTC
贺海斌
武汉大学
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up