Nithin Rao Koluguri
Scholar

Nithin Rao Koluguri

Google Scholar ID: YPtcUTUAAAAJ
NVIDIA Corporation
Speech ProcessingDeep Neural NetworksMachine Learning
Citations & Impact
All-time
Citations
768
 
H-index
14
 
i10-index
16
 
Publications
20
 
Co-authors
14
list available
Resume (English only)
Academic Achievements
  • Created TitaNet architecture for speaker recognition, widely adopted with ~1.5M monthly downloads on Hugging Face
  • Co-built the first speaker diarization modules in NeMo using TitaNet embeddings and NME-SC clustering
  • Led billion-parameter scaling of FastConformer ASR models to advance speech recognition capabilities
  • Leading development of Parakeet model series; parakeet-tdt-0.6b-v2 currently ranks #1 on Hugging Face Open-ASR leaderboard
  • Co-author of 'Sortformer: Seamless integration of speaker diarization and ASR by bridging timestamps and tokens'
  • Developed speech classifiers for ALS and Parkinson’s disease detection using voice as a biomarker at IISc
  • Improved ASR by generating n-best path lists under various noise conditions using Kaldi during USC research