Scholar
Jiaxuan Liu
Google Scholar ID: 19Cg4EcAAAAJ
University of Science and Technology of China
Text-to-Speech
Speech LLM
AGI
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
3
H-index
1
i10-index
0
Publications
2
Co-authors
1
list available
Contact
No contact links provided.
Publications
9 items
Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing
2026
Cited
0
PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks
2026
Cited
0
ERNIE 5.0 Technical Report
2026
Cited
0
PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing
2026
Cited
0
FunCineForge: A Unified Dataset Toolkit and Model for Zero-Shot Movie Dubbing in Diverse Cinematic Scenes
2026
Cited
0
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
2025
Cited
0
PaddleOCR 3.0 Technical Report
2025
Cited
0
UDDETTS: Unifying Discrete and Dimensional Emotions for Controllable Emotional Text-to-Speech
2025
Cited
0
Load more
Resume (English only)
Co-authors
1 total
Zhen-Hua Ling(凌震华)
Professor, University of Science and Technology of China
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up