Jaeyeon Kim
Scholar

Jaeyeon Kim

Google Scholar ID: 2Yi8qMIAAAAJ
PhD Student, Carnegie Mellon University
Deep LearningSpeech Processing
Citations & Impact
All-time
Citations
64
 
H-index
4
 
i10-index
1
 
Publications
8
 
Co-authors
10
list available
Resume (English only)
Academic Achievements
  • - Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span (NeurIPS Spotlight, 2025)
  • - WoW-Bench: Evaluating Fine-Grained Acoustic Perception in Audio-Language Models via Marine Mammal Vocalizations (arXiv preprint arXiv:2508.20976, 2025)
  • - ViSAGe: Video-to-Spatial Audio Generation (ICLR, 2025)
  • - Learning Semantic Information from Raw Audio Signal Using Both Contextual and Phonetic Representations (ICASSP, 2024)
  • - EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning (ICASSP, 2024)
Education
  • PhD student at the Language Technologies Institute, Carnegie Mellon University. Co-advised by Professor Carlos Busso and Professor Shinji Watanabe.
Background
  • Research Interests: Developing artificial intelligence that understands and interacts with the world in a human-like manner by integrating multiple modalities — particularly by bridging linguistic and visual knowledge with auditory information.
Miscellany
  • Feel free to contact me through jaeyeon2@andrew.cmu.edu!