Scholar
Chia-Wen Kuo
Google Scholar ID: iip65VkAAAAJ
ByteDance US
Multimodal
Vision and Language
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,729
H-index
12
i10-index
12
Publications
20
Co-authors
3
list available
Contact
No contact links provided.
Publications
4 items
Vidi2: Large Multimodal Models for Video Understanding and Creation
2025
Cited
0
Vidi: Large Multimodal Models for Video Understanding and Editing
2025
Cited
0
Where do Large Vision-Language Models Look at when Answering Questions?
2025
Cited
0
Rethinking Homogeneity of Vision and Text Tokens in Large Vision-and-Language Models
2025
Cited
0
Resume (English only)
Co-authors
3 total
Zsolt Kira
Associate Professor, Georgia Institute of Technology
Chih-Yao Ma
Member of Technical Staff @ Microsoft AI
Yen-Cheng Liu
Research Scientist, Meta
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up