- Hiformer: Sequence Modeling Networks with Hierarchical Attention Mechanisms, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
- Estimating the Uncertainty in Emotion Class Labels With Utterance-Specific Dirichlet Priors, IEEE Transactions on Affective Computing, 2022
- Exemplar-based Emotive Speech Synthesis, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021
- Speech Emotion Recognition Using Sequential Capsule Networks, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021
- Any-to-Many Voice Conversion With Location-Relative Sequence-to-Sequence Modeling, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021
- Intonation classification for L2 English speech using multi-distribution deep neural networks, Computer Speech & Language, 2016
- Inferring Speaking Styles from Multi-modal Conversational Context by Multi-scale Relational Graph Convolutional Networks, ACM MM'22
- Neural Architecture Search for Speech Emotion Recognition, ICASSP'22
- Ensemble Approaches for Uncertainty in Spoken Language Assessment, Interspeech'20
- Speech Emotion Recognition Using Capsule Networks, ICASSP'19
- Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis, Interspeech'18
- Feature based Adaptation for Speaking Style Synthesis, ICASSP'18
Research Experience
- Assistant Professor, Department of Systems Engineering and Engineering Management, CUHK
- Research Associate in the Speech Group of the Machine Intelligence Laboratory, Engineering Department of University of Cambridge, supervised by Prof. Mark Gales and Dr. Kate Knill
Education
- Ph.D. from CUHK, supervised by Prof. Helen Meng
- M.S. from Tsinghua University, supervised by Prof. Zhiyong Wu
Background
Currently an Assistant Professor at the Department of Systems Engineering and Engineering Management, CUHK. Research interests include speech synthesis and recognition, affective computing, and neural network uncertainty.