Scholar

Jiuhai Chen

Google Scholar ID: eJP77eoAAAAJ

University of Maryland

MultimodalLarge Language Model

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,494

H-index

21

i10-index

26

Publications

20

Co-authors

6

list available

Contact

No contact links provided.

Publications

8 items

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

2025

Cited

0

BLIP3o-NEXT: Next Frontier of Native Image Generation

2025

Cited

0

Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation

2025

Cited

0

LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer

2025

Cited

0

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

2025

Cited

0

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

2025

Cited

0

Transfer between Modalities with MetaQueries

2025

Cited

0

Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement

arXiv.org · 2024

Cited

20

Resume (English only)

Co-authors

6 total

Volpi-Cupal Professor of Computer Science, University of Maryland

Principal Research Scientist, Amazon Web Services

Courant Institute, New York University

Unknown affiliation