Scholar

Orr Zohar

Google Scholar ID: Jjw4rL0AAAAJ

Stanford University

Large Multimodal ModelsFoundation ModelsVision-Language Models

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,290

H-index

15

i10-index

18

Publications

20

Co-authors

7

list available

Contact

No contact links provided.

Publications

6 items

ViTok-v2: Scaling Native Resolution Auto-Encoders to 5 Billion Parameters

2026

Cited

0

The Impact of Image Resolution on Biomedical Multimodal Large Language Models

2025

Cited

0

FineVision: Open Data Is All You Need

2025

Cited

0

SmolVLM: Redefining small and efficient multimodal models

2025

Cited

2

A Large-Scale Vision-Language Dataset Derived from Open Scientific Literature to Advance Biomedical Generalist AI

2025

Cited

0

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

2025

Cited

0

Resume (English only)

Co-authors

7 total

Serena Yeung-Levy

Stanford University

Stanford University

Stanford University

Kuan-Chieh Wang

Research Scientist, Google

Google Research