Satvik Dixit
Scholar

Satvik Dixit

Google Scholar ID: fO8a44AAAAAJ
Carnegie Mellon University
Speech and AudioLarge Language Models
Citations & Impact
All-time
Citations
23
 
H-index
2
 
i10-index
0
 
Publications
7
 
Co-authors
4
list available
Resume (English only)
Academic Achievements
  • - Mellow: a small audio language model for reasoning (NeurIPS 2025)
  • - MACE: Leveraging Audio for Evaluating Audio Captioning Systems (ICASSP 2025 SALMA Workshop)
  • - Vision Language Models Are Few-Shot Audio Spectrogram Classifiers (NeuRIPS 2024 Audio Imagination Workshop)
Research Experience
  • - Worked with Professor Chris Donahue on Generative Audio
  • - Worked with Professor Bhiksha Raj on Audio Language Models
  • - Interned with Dr. Satrajit Ghosh at MIT
  • - Interned with Dr. Martin Vetterli at EPFL
Education
  • Undergraduate degree in Electrical Engineering from IIT Delhi, with a concentration on signals processing and ML.
Background
  • Master's student at Carnegie Mellon University, interested in audio understanding and generation.
Miscellany
  • Email: satvikdixit@cmu.edu
  • Google Scholar: Google Scholar
  • LinkedIn: LinkedIn