Scholar

Salah Zaiem

Google Scholar ID: hVXGK5wAAAAJ

GoogleDeepmind

Audio & Speech Processing

Citations & Impact

All-time

Citations

813

H-index

i10-index

Publications

Co-authors

Contact

Publications

20 items

Browse publications on Google Scholar (top-right) ↗

Resume (English only)

Academic Achievements

Recently involved in two audio-visual generative works at Google Deepmind: Veo 2 (State-of-the-Art video generation) and Video-to-audio generation.

Research Experience

Currently a Research Scientist at Google Deepmind working on audio-visual generation. During his PhD, he interned at Google Research in Zurich, supervised by Zalan Borsos and Félix de Chaumont-Quitry, focusing on generative audio technologies; also interned at MILA, Montréal, supervised by Mirco Ravanelli, working on speech self-supervision evaluation and use within the SpeechBrain Library.

Education

PhD: Telecom Paris, supervised by Slim Essid and Titouan Parcollet, graduated in March 2024; Master's degree: ENS Paris-Saclay (MVA Program), major in Machine Learning; Bachelor's degree: Ecole Polytechnique, major in Applied Mathematics and Computer Science.

Background

Research interests: Machine Learning applied to Language, Speech, and Audio. Current work focuses on video-conditioned audio generation.

Miscellany

Contributed to the SpeechBrain library, recommending it for beginners or those who are fed up with their current deep learning for speech framework.

Co-authors

0 total

Co-authors: 0 (list not available)