Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
Recently involved in two audio-visual generative works at Google Deepmind: Veo 2 (State-of-the-Art video generation) and Video-to-audio generation.
Research Experience
Currently a Research Scientist at Google Deepmind working on audio-visual generation. During his PhD, he interned at Google Research in Zurich, supervised by Zalan Borsos and Félix de Chaumont-Quitry, focusing on generative audio technologies; also interned at MILA, Montréal, supervised by Mirco Ravanelli, working on speech self-supervision evaluation and use within the SpeechBrain Library.
Education
PhD: Telecom Paris, supervised by Slim Essid and Titouan Parcollet, graduated in March 2024; Master's degree: ENS Paris-Saclay (MVA Program), major in Machine Learning; Bachelor's degree: Ecole Polytechnique, major in Applied Mathematics and Computer Science.
Background
Research interests: Machine Learning applied to Language, Speech, and Audio. Current work focuses on video-conditioned audio generation.
Miscellany
Contributed to the SpeechBrain library, recommending it for beginners or those who are fed up with their current deep learning for speech framework.