Scholar
Yun Zheng
Google Scholar ID: -hFpScAAAAAJ
Alibaba
Computer Vision
Multimodal Modeling
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
1,670
H-index
15
i10-index
22
Publications
20
Co-authors
24
list available
Contact
No contact links provided.
Publications
12 items
AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation
2026
Cited
0
GenAgent: Scaling Text-to-Image Generation via Agentic Multimodal Reasoning
2026
Cited
1
ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement
2025
Cited
0
UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing
2025
Cited
0
DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding
2025
Cited
0
ChronoTailor: Harnessing Attention Guidance for Fine-Grained Video Virtual Try-On
2025
Cited
0
Aligned Better, Listen Better for Audio-Visual Large Language Models
2025
Cited
0
Wan: Open and Advanced Large-Scale Video Generative Models
2025
Cited
1
Load more
Resume (English only)
Co-authors
24 total
Yanhao Zhang
Alibaba Damo Academy, OPPO AI Center
Siyang Sun
Alibaba Group
Chen-Wei Xie
Alibaba Group
Qiang Wang
Apple
Pandeng Li
University of Science and Technology of China && Alibaba Tongyi
Deli Zhao
Alibaba DAMO Academy
Rong Jin
Alibaba Group
xiaoyi bao
CASIA
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up