Published 24 peer-reviewed papers across top venues such as ICLR, ACL, NAACL, WACV, NeurIPS, EMNLP, CVPR; received the Best Paper Award at the Foundation Models Meet Embodied Agents Workshop @ CVPR 2025.
Research Experience
Currently a Senior Research Scientist at Google Research, focusing on multimodal consistency; has mentored several MSc and PhD students towards their publication goals throughout his academic career.
Education
- PhD in Computer Science (Vision-and-Language), 2020-2023, The Hebrew University of Jerusalem, advised by Dr. Roy Schwartz and Dr. Gabriel Stanovsky.
- MSc in Computer Science (Natural Language Processing), Magna cum laude, 2018-2019, Ben Gurion University, supervised by Prof. Michael Elhadad and Prof. Eitan Bachmat.
- BSc in Computer Science, 2015-2018, Ben Gurion University.
Background
Research Interests: Multimodal consistency, improving large vision-and-language models, developing feedback models for text-to-image and text-to-video applications, multimodal factuality. Professional Field: Computer Science, with a focus on vision and language processing.