- GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement (Submitted to NIPS 2024)
- Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems (ISCA Interspeech 2023, Dublin, Ireland, Oral Presentation)
- Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems (ISCA Interspeech 2023, Dublin, Ireland, Oral Presentation)
- A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One (IEEE ICASSP2023, Rhodes Island, Greece, Oral Presentation)
- Two-pass decoding and cross-adaptation based system combination of end-to-end conformer and hybrid tdnn asr systems (ISCA Interspeech 2022, Incheon, Korea)
Research Experience
- 2024.06 - Now: Research Intern, Noah’s Ark Lab, Hong Kong SAR, China
- 2023.10 - 2024.02: Remote Research Intern, Speech Lab, Alibaba DAMO Academy, China
- 2022.03 - 2023.09: Research Intern, International Digital Economy Academy (IDEA), China
- 2020.08 - 2021.09: Research Assistant, The Chinese University of Hong Kong (CUHK), China
Education
- Since 2021.09: Ph.D. student, The Chinese University of Hong Kong (CUHK), China, Supervisors: Prof. LIU Xunying and Prof. CHEN Xie
- 2019.09 - 2020.06: Master of Computer Science, The Chinese University of Hong Kong (CUHK), China
- 2015.06 - 2019.07: Bachelor of Software Engineering, SouthEast University (SEU), China
Background
- Research Interests: Long-context ASR, multimodal LLM, and streaming LLM
- Professional Field: Systems Engineering and Engineering Management
- Brief Introduction: Currently a Ph.D. student at the Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong (CUHK), supervised by Prof. LIU Xunying and co-supervised by Prof. CHEN Xie.
Miscellany
- Teaching Assistance: ENGG 1120 C, Linear Algebra; FTEC 4006, Internet Finance; SEEM 2420, Operations Research I