Shuai Wang
Scholar

Shuai Wang

Google Scholar ID: vW1ZaucAAAAJ
Nanjing University
speaker recognitiondeep learningspeech processing
Citations & Impact
All-time
Citations
2,715
 
H-index
29
 
i10-index
51
 
Publications
20
 
Co-authors
18
list available
Publications
20 items
Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
  • Published more than 60 papers at top-tier speech conferences/journals; involved in multiple open-source projects such as WeSpeaker (a comprehensive speaker embedding learning toolkit), WeSep (the first open-source target speaker extraction toolkit), DiffRhythm (diffusion-based rhythmic music generation), SongBloom (autoregressive diffusion-based music generation).
Research Experience
  • Prior to joining Nanjing University, served as a research scientist in Prof. Haizhou Li's team at the Shenzhen Research Institute of Big Data, Chinese University of Hong Kong (Shenzhen), where he still holds an adjunct position now. Additionally, spent 2.5 years as a senior research scientist at Lightspeed & Quantum Studios, Tencent, leading the speech group in R&D of speech technologies customized for games.
Education
  • Earned B.E. degree from Northwestern Polytechnical University in 2014 under the supervision of Prof. Lei Xie; Ph.D. degree from Shanghai Jiao Tong University in 2020, supervised by Prof. Kai Yu and Prof. Yanmin Qian.
Background
  • Research interests include speaker modeling, target speaker processing, speech synthesis, voice conversion, and music generation. Published over 60 papers in top-tier speech conferences/journals.
Miscellany
  • Currently looking for graduate students and research assistants interested in specific research areas; welcomes sophomore and junior students from Nanjing University for internships; NJU students can drop by Room 536 at Nanyong Building for face-to-face discussion.