Mohan Shi
Scholar

Mohan Shi

Google Scholar ID: w-S1thkAAAAJ
University of California, Los Angeles | Ex-USTC
Speech RecognitionSpeech LLMMulti-modal LLMDeep Learning
Citations & Impact
All-time
Citations
71
 
H-index
6
 
i10-index
2
 
Publications
11
 
Co-authors
15
list available
Resume (English only)
Academic Achievements
  • - CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASR, Interspeech 2025
  • - Advancing Multi-talker ASR Performance with Large Language Models, SLT 2024
  • - LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization, Interspeech 2024 (Oral)
  • - CASA-ASR: Context-Aware Speaker-Attributed ASR, Interspeech 2023
  • - Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction, Interspeech 2023 (Oral)
  • - A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party Meetings, APSIPA ASC 2023
  • - The Second Multi-Channel Multi-Party Meeting Transcription Challenge (M2MeT 2.0): A Benchmark for Speaker-Attributed ASR, ASRU 2023
  • - Non-autoregressive End-to-End Speaker-Attributed ASR, ASRU 2023
  • - The USTC-NELSLIP offline speech translation systems for IWSLT 2022, IWSLT 2022
Research Experience
  • - Microsoft Research, Redmond, USA: Research Intern, CoreAI Speech Team, June 2025 – Sep 2025, Manager: Jinyu Li, Mentors: Xiong Xiao, Ruchao Fan, Shaoshi Ling
  • - Tencent AI Lab, Bellevue, USA (remote): Research Intern, Seattle Speech Lab, Sep 2023 – August 2024, Manager: Dong Yu, Mentors: Yong Xu, Shi-Xiong Zhang
  • - Alibaba Group, Hangzhou, China: Research Intern, Tongyi Speech Team, Jul 2022 – May 2023, Manager: Zhijie Yan, Mentors: Shiliang Zhang, Zhihao Du
Education
  • - University of California, Los Angeles (UCLA): Ph.D. student in Electrical and Computer Engineering, Sep 2024 – Present, Advisor: Prof. Abeer Alwan
  • - University of Science and Technology of China (USTC): Master of Engineering in Electronic Engineering and Information Science, Sep 2021 – Jun 2024, Advisor: Prof. Li-Rong Dai
  • - Dalian University of Technology: Bachelor of Engineering in Electronic Information Engineering, Sep 2017 – Jun 2021, GPA Rank: 1 / 185
Background
  • - Research Interests: Automatic Speech Recognition, Speech-centric Large Language Models, Child/Low-resource Speech Processing, Discrete Speech Tokens, Cocktail Party Problems
  • - Field: Electrical and Computer Engineering
  • - Brief Introduction: Ph.D. student at UCLA, advised by Prof. Abeer Alwan.