- CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASR, Interspeech 2025
- Advancing Multi-talker ASR Performance with Large Language Models, SLT 2024
- LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization, Interspeech 2024 (Oral)
- The USTC-NELSLIP offline speech translation systems for IWSLT 2022, IWSLT 2022
Research Experience
- Microsoft Research, Redmond, USA: Research Intern, CoreAI Speech Team, June 2025 – Sep 2025, Manager: Jinyu Li, Mentors: Xiong Xiao, Ruchao Fan, Shaoshi Ling
- Tencent AI Lab, Bellevue, USA (remote): Research Intern, Seattle Speech Lab, Sep 2023 – August 2024, Manager: Dong Yu, Mentors: Yong Xu, Shi-Xiong Zhang
- Alibaba Group, Hangzhou, China: Research Intern, Tongyi Speech Team, Jul 2022 – May 2023, Manager: Zhijie Yan, Mentors: Shiliang Zhang, Zhihao Du
Education
- University of California, Los Angeles (UCLA): Ph.D. student in Electrical and Computer Engineering, Sep 2024 – Present, Advisor: Prof. Abeer Alwan
- University of Science and Technology of China (USTC): Master of Engineering in Electronic Engineering and Information Science, Sep 2021 – Jun 2024, Advisor: Prof. Li-Rong Dai
- Dalian University of Technology: Bachelor of Engineering in Electronic Information Engineering, Sep 2017 – Jun 2021, GPA Rank: 1 / 185
Background
- Research Interests: Automatic Speech Recognition, Speech-centric Large Language Models, Child/Low-resource Speech Processing, Discrete Speech Tokens, Cocktail Party Problems
- Field: Electrical and Computer Engineering
- Brief Introduction: Ph.D. student at UCLA, advised by Prof. Abeer Alwan.