Salah Zaiem
Scholar

Salah Zaiem

Google Scholar ID: hVXGK5wAAAAJ
GoogleDeepmind
Audio & Speech Processing
Citations & Impact
All-time
Citations
813
 
H-index
13
 
i10-index
13
 
Publications
20
 
Co-authors
0
 
Publications
20 items
Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
  • Recently involved in two audio-visual generative works at Google Deepmind: Veo 2 (State-of-the-Art video generation) and Video-to-audio generation.
Research Experience
  • Currently a Research Scientist at Google Deepmind working on audio-visual generation. During his PhD, he interned at Google Research in Zurich, supervised by Zalan Borsos and Félix de Chaumont-Quitry, focusing on generative audio technologies; also interned at MILA, Montréal, supervised by Mirco Ravanelli, working on speech self-supervision evaluation and use within the SpeechBrain Library.
Education
  • PhD: Telecom Paris, supervised by Slim Essid and Titouan Parcollet, graduated in March 2024; Master's degree: ENS Paris-Saclay (MVA Program), major in Machine Learning; Bachelor's degree: Ecole Polytechnique, major in Applied Mathematics and Computer Science.
Background
  • Research interests: Machine Learning applied to Language, Speech, and Audio. Current work focuses on video-conditioned audio generation.
Miscellany
  • Contributed to the SpeechBrain library, recommending it for beginners or those who are fed up with their current deep learning for speech framework.
Co-authors
0 total
Co-authors: 0 (list not available)