Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
Recipient of the 2023 Google Research PA Tech Impact Award
Manzano: A Simple and Scalable Unified Multimodal Model, first author (Launched at Google I/O 2024)
Imagen3: Google’s highest-quality text-to-image model, core contributor (Launched at Google I/O 2024)
Veo: Google’s most capable video generation model to date, core contributor (Launched at Google I/O 2024)
Published numerous papers at top-tier conferences including ICML, CVPR, ECCV, NeurIPS, and AAAI, covering video understanding, multimodal learning, self-supervised learning, and image deraining
Research Experience
Staff Research Scientist, Apple AI/ML Foundation Model Team
Research Scientist, Google
Research Intern, Google Research (May 2022 – Aug 2022); Hosted by Dr. Yin Cui, Dr. Boqing Gong, Dr. Tsung-Yi Lin, and Prof. Ming-Hsuan Yang
Research Intern, Bytedance AI Research (Mar 2019 – Jul 2019); Hosted by Dr. Ding Liu and Dr. Xiaohui Shen
Research Intern, Microsoft Research (Sept 2018 – Mar 2019); Hosted by Dr. Stephen Lin