Scholar

Mohan Shi

Google Scholar ID: w-S1thkAAAAJ

University of California, Los Angeles | Ex-USTC

Speech RecognitionSpeech LLMMulti-modal LLMDeep Learning

Citations & Impact

All-time

Citations

H-index

i10-index

Publications

Co-authors

list available

Contact

Publications

4 items

2026

Cited

2026

Cited

2026

Cited

2025

Cited

Resume (English only)

Academic Achievements

- CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASR, Interspeech 2025
- Advancing Multi-talker ASR Performance with Large Language Models, SLT 2024
- LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization, Interspeech 2024 (Oral)
- CASA-ASR: Context-Aware Speaker-Attributed ASR, Interspeech 2023
- Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction, Interspeech 2023 (Oral)
- A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party Meetings, APSIPA ASC 2023
- The Second Multi-Channel Multi-Party Meeting Transcription Challenge (M2MeT 2.0): A Benchmark for Speaker-Attributed ASR, ASRU 2023
- Non-autoregressive End-to-End Speaker-Attributed ASR, ASRU 2023
- The USTC-NELSLIP offline speech translation systems for IWSLT 2022, IWSLT 2022

Research Experience

- Microsoft Research, Redmond, USA: Research Intern, CoreAI Speech Team, June 2025 – Sep 2025, Manager: Jinyu Li, Mentors: Xiong Xiao, Ruchao Fan, Shaoshi Ling
- Tencent AI Lab, Bellevue, USA (remote): Research Intern, Seattle Speech Lab, Sep 2023 – August 2024, Manager: Dong Yu, Mentors: Yong Xu, Shi-Xiong Zhang
- Alibaba Group, Hangzhou, China: Research Intern, Tongyi Speech Team, Jul 2022 – May 2023, Manager: Zhijie Yan, Mentors: Shiliang Zhang, Zhihao Du

Education

- University of California, Los Angeles (UCLA): Ph.D. student in Electrical and Computer Engineering, Sep 2024 – Present, Advisor: Prof. Abeer Alwan
- University of Science and Technology of China (USTC): Master of Engineering in Electronic Engineering and Information Science, Sep 2021 – Jun 2024, Advisor: Prof. Li-Rong Dai
- Dalian University of Technology: Bachelor of Engineering in Electronic Information Engineering, Sep 2017 – Jun 2021, GPA Rank: 1 / 185

Background

- Research Interests: Automatic Speech Recognition, Speech-centric Large Language Models, Child/Low-resource Speech Processing, Discrete Speech Tokens, Cocktail Party Problems
- Field: Electrical and Computer Engineering
- Brief Introduction: Ph.D. student at UCLA, advised by Prof. Abeer Alwan.

Co-authors

15 total