Academic service: [2024] Organized workshop on Speech and Audio Language Models (SALMA) at ICASSP 2025; [2023] Organized special session at ICASSP 2023; [Reviewer] ICASSP, INTERSPEECH, NeurIPS, ICLR, DCASE, TASLP
Research Experience
Senior Applied Scientist on the Microsoft Speech team. Recent works include Video Translation, Pengi, CLAP.
Education
PhD: Carnegie Mellon University; B.Tech: VJTI
Background
Broad research interests include Audio/Speech Processing and Multimodal Learning. Research gets deployed in products like Teams, Edge, Outlook.
Miscellany
Links: Google Scholar, GitHub, Twitter, LinkedIn, CV