Scholar

Chia-Wen Kuo

Google Scholar ID: iip65VkAAAAJ

ByteDance US

MultimodalVision and Language

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,776

H-index

13

i10-index

14

Publications

20

Co-authors

3

list available

Contact

No contact links provided.

Publications

5 items

VEBench:Benchmarking Large Multimodal Models for Real-World Video Editing

2026

Cited

0

Vidi2: Large Multimodal Models for Video Understanding and Creation

2025

Cited

0

Vidi: Large Multimodal Models for Video Understanding and Editing

2025

Cited

0

Where do Large Vision-Language Models Look at when Answering Questions?

2025

Cited

0

Rethinking Homogeneity of Vision and Text Tokens in Large Vision-and-Language Models

2025

Cited

0

Resume (English only)

Co-authors

3 total

Associate Professor, Georgia Institute of Technology

Member of Technical Staff @ Microsoft AI

Research Scientist, Meta