Wanrong Zhu
Scholar

Wanrong Zhu

Google Scholar ID: xNWgry0AAAAJ
Adobe Research
Vision and LanguageNatural Language Processing
Citations & Impact
All-time
Citations
2,698
 
H-index
19
 
i10-index
22
 
Publications
20
 
Co-authors
17
list available
Resume (English only)
Academic Achievements
  • [{'Title': 'List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs', 'Authors': 'An Yan, Zhengyuan Yang, Junda Wu, Wanrong Zhu, Jianwei Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Julian McAuley, Jianfeng Gao, Lijuan Wang', 'Conference': 'The First Conference on Language Modeling (CoLM 2024)'}, {'Title': 'VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View', 'Authors': 'Raphael Schumann, Wanrong Zhu, Weixi Feng, Tsu-Jui Fu, Stefan Riezler, William Yang Wang', 'Conference': 'The Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI 2024)'}, {'Title': 'OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models', 'Authors': 'Anas Awadalla, Irena Gao, Josh Gardner, Jack Hessel, Yusuf Hanafy, Wanrong Zhu, Kalyani Marathe, Yonatan Bitton, Samir Gadre, Shiori Sagawa, Jenia Jitsev, Simon Kornblith, Pang Wei Koh, Gabriel Ilharco, Mitchell Wortsman, Ludwig Schmidt', 'Publication': 'Technical Report'}, {'Title': 'Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text', 'Authors': 'Wanrong Zhu*, Jack Hessel*, Anas Awadalla, Samir Yitzhak Gadre, Jesse Dodge, Alex Fang, Youngjae Yu, Ludwig Schmidt, William Yang Wang, Yejin Choi', 'Conference': 'The Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track (NeurIPS D&B 2023)'}, {'Title': 'VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use', 'Authors': 'Yonatan Bitton*, Hritik Bansal*, Jack Hessel*, Rulin Shao, Wanrong Zhu, Anas Awadalla, Josh Gardner, Rohan Taori, Ludwig Schimdt', 'Conference': 'The Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track (NeurIPS D&B 2023)'}, {'Title': 'LayoutGPT: Compositional Visual Planning and Generation with Large Language Models', 'Authors': 'Weixi Feng*, Wanrong Zhu*, Tsu-Jui Fu, Varun Jampani, Arjun Reddy Akula, Xuehai He, Sugato Basu, Xin Eric Wang, William Yang Wang', 'Conference': 'The Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)'}, {'Title': 'Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Learning', 'Authors': 'Xinyi Wang, Wanrong Zhu, Michael Saxon, Mark Steyvers, William Yang Wang', 'Conference': 'The Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)'}, {'Title': 'Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation', 'Authors': 'Wanrong Zhu, Xinyi Wang, Yujie Lu, Tsu-Jui Fu, Xin Eric Wang, Miguel Eckstein, William Yang Wang', 'Conference': 'The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023, Short)'}, {'Title': 'Visualize Before You Write: Imagination-Guided Open-Ended Text Generation', 'Authors': 'Wanrong Zhu, An Yan, Yujie Lu, Wenda Xu, Xin Eric Wang, Miguel Eckstein, William Yang Wang', 'Conference': 'The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023, Short)'}]
Research Experience
  • [{'Position': 'Research Intern', 'Institution': 'Adobe Research', 'Hosts': 'Jennifer Healey and Ruiyi Zhang', 'Time': 'June 2023 - Sep. 2023'}, {'Position': 'Research Intern', 'Institution': 'AI2 Mosaic', 'Hosts': 'Jack Hessel and Youngjae Yu', 'Time': 'June 2022 - Sep. 2022'}, {'Position': 'Research Intern', 'Institution': 'Google Research', 'Hosts': 'Bo Pang and Ashish Thapliyal', 'Time': 'June 2021 - Oct. 2021'}, {'Position': 'Research Intern', 'Institution': 'Google Ads', 'Hosts': 'Pradyumna Narayana', 'Time': 'June 2020 - Oct. 2020'}, {'Position': 'Research Assistant', 'Institution': 'Language Technology Institution, Carnegie Mellon University', 'Advisor': 'Zhiting Hu', 'Time': 'July 2018 - Sep. 2018'}]
Background
  • Research interest: Multimodal study, particularly in vision-and-language and text generation.
Miscellany
  • Named a 2023 Rising Stars in Machine Learning by University of Maryland.