Feng Yang
Scholar

Feng Yang

Google Scholar ID: XI8oQn8AAAAJ
Principal Engineer (Director), GenAI, Google DeepMind
LLMMultimediaAIMultimodal GenerationComputer Vision
Citations & Impact
All-time
Citations
10,715
 
H-index
23
 
i10-index
30
 
Publications
20
 
Co-authors
56
list available
Contact
No contact links provided.
Publications
20 items
Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
  • Published 54 research articles (including 23 CVPR, ICCV, ECCV, NeurIPS, TIP papers). Won CVPR 2024 best paper award, CVPR 2022 best paper finalist. Holds 50+ patents, some of which were sold to Rambus Inc. Co-initiated many projects, served as tech lead for 130+ launches with Ads, YouTube, Cloud, Search, Photos, Pixel, Commerce, Play, etc. These launches significantly boosted core metrics, for example, increased Google revenue by ~$1B/year and enabled another $xB/year, won 2 Google Tech Impact Awards, 1 Google DeepMind Tech Impact Award, 2 Google Research Tech Awards, 3 Ads Tech Impact Awards, and 1 SAGE award, 20 Perfy Awards including 2 Editor's Choice and 2 Golden.
Research Experience
  • Currently a Principal Scientist (Director) at GenAI, Google DeepMind, leading a team working on research and productionization of Gemini, Imagen, and Veo. Previously, he was a postdoctoral researcher at Illumination&Imaging Lab, Robotics Institute, CMU, supervised by Prof. Srinivasa Narasimhan. He was a Research Assistant with Audiovisual Communications Laboratory, EPFL, advised by Prof. Martin Vetterli, and the Broadband Network and Digital Multimedia Laboratory, Tsinghua University, advised by Prof. Qionghai Dai.
Education
  • Received B.Eng. and M.Eng. degrees in automatic control from Tsinghua University, Beijing, China, in 2004 and 2007, respectively. Obtained Ph.D. degree in communication systems at the École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland in 2012. Won Outstanding master thesis of Tsinghua University and Fritz Kutter Award for best PhD thesis.
Background
  • Research interests include LLM & VLM, multimodal understanding & generation, responsible AI, computer vision, multimedia processing and communications.
Miscellany
  • During his time at Google, most of the images/videos served by Google are processed by one or more algorithms developed by Feng and his team. He also serves as Senior Associate Editor, Area Chair, Associate Editor, Co-Organizer, Industry Chair, Publicity Chair, and PC Member for various academic journals and conferences.