Soham Deshmukh
Scholar

Soham Deshmukh

Google Scholar ID: MasiEogAAAAJ
Microsoft, Carnegie Mellon University
Audio machine learningAudio processingSpeech processing
Citations & Impact
All-time
Citations
1,557
 
H-index
13
 
i10-index
18
 
Publications
20
 
Co-authors
15
list available
Resume (English only)
Academic Achievements
  • Academic service: [2024] Organized workshop on Speech and Audio Language Models (SALMA) at ICASSP 2025; [2023] Organized special session at ICASSP 2023; [Reviewer] ICASSP, INTERSPEECH, NeurIPS, ICLR, DCASE, TASLP
Research Experience
  • Senior Applied Scientist on the Microsoft Speech team. Recent works include Video Translation, Pengi, CLAP.
Education
  • PhD: Carnegie Mellon University; B.Tech: VJTI
Background
  • Broad research interests include Audio/Speech Processing and Multimodal Learning. Research gets deployed in products like Teams, Edge, Outlook.
Miscellany
  • Links: Google Scholar, GitHub, Twitter, LinkedIn, CV