Scholar
Orr Zohar
Google Scholar ID: Jjw4rL0AAAAJ
Stanford University
Large Multimodal Models
Foundation Models
Vision-Language Models
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,290
H-index
15
i10-index
18
Publications
20
Co-authors
7
list available
Contact
No contact links provided.
Publications
6 items
ViTok-v2: Scaling Native Resolution Auto-Encoders to 5 Billion Parameters
2026
Cited
0
The Impact of Image Resolution on Biomedical Multimodal Large Language Models
2025
Cited
0
FineVision: Open Data Is All You Need
2025
Cited
0
SmolVLM: Redefining small and efficient multimodal models
2025
Cited
2
A Large-Scale Vision-Language Dataset Derived from Open Scientific Literature to Advance Biomedical Generalist AI
2025
Cited
0
Learnings from Scaling Visual Tokenizers for Reconstruction and Generation
2025
Cited
0
Resume (English only)
Co-authors
7 total
Serena Yeung-Levy
Stanford University
Xiaohan Wang
Stanford University
Yuhui Zhang
Stanford University
Kuan-Chieh Wang
Snap Inc.
Co-author 5
Yonatan Bitton
Research Scientist, Google
Idan Szpektor
Google Research
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up