Published multiple papers such as 'Audio-FLAN: An Instruction-Following Dataset for Unified Understanding and Generation of Speech, Music, and Sound' and more. Involved in several projects, including the development of the Amphion open-source platform.
Research Experience
Served as a research intern at Microsoft (2019.04 - 2020.06, 2021.11 - 2022.10), Tencent AI Lab (2021.06 - 2021.11), and JD.COM AI Lab (2018.10 - 2019.04).
Education
Received Ph.D. degree from the Audio, Speech and Language Processing Laboratory at Northwestern Polytechnical University, supervised by Prof. Lei Xie.
Background
Currently a Postdoctoral Researcher at Hong Kong University of Science and Technology, working with Prof. Yike Guo and Prof. Wei Xue. Research interests include audio, speech and language processing, as well as audio, music, and speech generation.
Miscellany
Co-founder of Amphion, an open-source platform for Audio, Music, and Speech Generation.