Published several papers including 'Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities' and 'Imagen 3'; received the NTU SCSE Outstanding PhD Thesis Award (2023); recipient of the Google PhD Fellowship (2021).
Research Experience
Currently a Senior Research Scientist at Google DeepMind; served as an Area Chair for multiple international conferences such as CVPR, ICLR, NeurIPS, and ICML.
Education
Earned B.Sc. and B.Eng. degrees from The Chinese University of Hong Kong; M.Phil. degree in Mathematics from the same university; Ph.D. in Computer Science from Nanyang Technological University, supervised by Prof. Chen Change Loy.
Background
Senior Research Scientist specializing in multimodal models, with a focus on image generation and editing. He is a core contributor to Google's Gemini 2.5 Flash Image (Nano Banana), Imagen, and Gemini 2.0 Flash Native Image Generation.