Published 52 papers in top-tier speech conferences and journals, such as TASLP, SPM, ICASSP, and Interspeech. Recipient of the ASRU 2019 Best Paper Award, EMNLP 2024 Best Paper Award, and MSRA 2021 Fellowship. Serves as a regular reviewer for top-tier conferences and journals, including TASLP, SPL, Speech Communication, ICASSP, and Interspeech. Received the Best Reviewer Award at ASRU 2023. Actively contributes to open-source projects, especially ESPnet (one of the most popular end-to-end speech processing toolkits), where he serves as one of the core maintainers.
Research Experience
Member of the AudioCC Lab led by Prof. Yanmin Qian.
Education
Received his Ph.D. degree from Shanghai Jiao Tong University in 2024 and his B.Sc. degree from Huazhong University of Science and Technology in 2018. Advisor: Prof. Yanmin Qian.
Background
Research interests: Speech signal processing in complex scenarios, including speech enhancement, separation, recognition, and self-supervised speech pre-training. Also highly interested in anything that helps to better understand AI.