Scholar

Chunyuan Li

Google Scholar ID: Zd7WmXUAAAAJ

xAI

Deep LearningVisionLanguageMultimodal

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

47,672

H-index

i10-index

136

Publications

Co-authors

list available

Contact

Emaillichunyuan24@gmail.com CVOpen ↗TwitterOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

10 items

LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence

2026

Cited

UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

2026

Cited

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

2026

Cited

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

2025

Cited

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

2025

Cited

Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning

2025

Cited

Video Instruction Tuning With Synthetic Data

arXiv.org · 2024

Cited

248

LLaVA-Critic: Learning to Evaluate Multimodal Models

arXiv.org · 2024

Cited

Resume (English only)

Academic Achievements

LLaVA series: including LLaVA-1.5 (SoTA on 11 open-source VLM benchmarks), LLaVA-NeXT, LLaVA-OneVision, LLaVA-Video, LLaVA-Critic, LLaVA-Med (NeurIPS 2023 Datasets and Benchmarks Track Spotlight), LLaVA-Interactive, and LLaVA-Plus.
Developed the proprietary industry-leading VLM Seed-VL-1.5 for image and video understanding.
Published numerous high-impact papers at NeurIPS (Oral/Spotlight), CVPR (Highlights), ECCV, and a survey in Foundations and Trends® in Computer Graphics and Vision.
Notable projects include REACT (CVPR 2023), GLIGEN (CVPR 2023), X-Decoder, K-LITE (NeurIPS 2022 Oral), ELEVATER, and FocalNet.
Authored a 110-page perspective paper 'Multimodal Foundation Models: From Specialists to General-Purpose Assistants' and delivered the CVPR 2023 tutorial on the topic.
Served as Area Chair for NeurIPS, ICML, ICLR, EMNLP, TMLR, and Guest Editor for an IJCV special issue on 'the promises and dangers of large vision models'.

Co-authors

16 total