Scholar
Zirui Wang
Google Scholar ID: GgD-B68AAAAJ
Apple AI/ML
Deep Learning
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
11,182
H-index
19
i10-index
23
Publications
20
Co-authors
14
list available
Contact
Email
ziruiw@apple.com
Twitter
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
1 items
Kestrel: Grounding Self-Refinement for LVLM Hallucination Mitigation
2026
Cited
0
Resume (English only)
Academic Achievements
Post-training lead for Apple Intelligence Foundation Language Models
CoCa: Contrastive Captioners are Image-Text Foundation Models (TMLR 2022, co-first author)
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision (ICLR 2022)
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains (Arxiv 2024)
ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities (Arxiv 2024)
Understanding Alignment in Multimodal LLMs: A Comprehensive Study (Arxiv 2024)
Ferret: Refer and Ground Anything Anywhere at Any Granularity (ICLR 2024)
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training (Arxiv 2024)
Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training (Arxiv 2024)
REVEAL: Retrieval-Augmented Visual-Language Pre-Training With Multi-Source Multimodal Knowledge Memory (CVPR 2023)
Guiding Image Captioning Models Toward More Specific Captions (CVPR 2023)
Co-authors
14 total
Jiahui Yu
Research Scientist, OpenAI
Yonghui Wu
Head of Research, ByteDance Seed
Co-author 3
Adams Wei Yu
Research Scientist, Google DeepMind
Yulia Tsvetkov
University of Washington
Orhan Firat
Google AI
Co-author 7
Ruoming Pang
Apple AI/ML
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up