Yi-Chiao, Wu
Scholar

Yi-Chiao, Wu

Google Scholar ID: KKaOQVwAAAAJ
Meta
Speech/Audio CodecSpeech/Audio GenerationSpeech/Audio Evaluation
Citations & Impact
All-time
Citations
2,624
 
H-index
20
 
i10-index
33
 
Publications
20
 
Co-authors
32
list available
Publications
20 items
Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
  • - FlowDec: A Flow-based Full-band General Audio Codec with High Perceptual Quality, ICLR, 2025
  • - ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling, ICASSP, 2025
  • - ScoreDec: A Phase-Preserving High-Fidelity Audio Codec with a Generalized Score-Based Diffusion Post-Filter, ICASSP, 2024
  • - AudioDec: An Open-Source Streaming High-Fidelity Neural Audio Codec, ICASSP, 2023
  • - A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System, APSIPA Trans., 2022
  • - Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder, Interspeech, 2021
  • - Quasi-Periodic Parallel WaveGAN: A Non-Autoregressive Raw Waveform Generative Model With Pitch-Dependent Dilated Convolution Neural Network, IEEE TASLP, 2021
  • - Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model With Pitch-Dependent Dilated Convolution Neural Network, IEEE TASLP, 2021
  • - Quasi-Periodic parallel WaveGAN vocoder: a non-autoregressive pitch-dependent dilated convolution model for parametric speech generation, Interspeech, 2020
  • - A cyclical post-filtering approach to mismatch refinement of neural vocoder for text-to-speech systems, Interspeech, 2020
Research Experience
  • - Research Scientist, FAIR, Meta, NYC, US (Oct. 2023 - present)
  • - Research Scientist, Codec Avatars Lab, Meta, NYC, US (Jan. 2022 - Sep. 2023)
  • - Postdoc Researcher, Academia Sinica, Taiwan (Oct. 2021 - Dec. 2021), Advisors: Hsin-Min Wang, Yu Tsao
  • - Research Assistant, Nagoya University, Japan (Oct. 2017 - Sep. 2021), Advisor: Tomoki Toda
  • - Summer Intern, National Institute of Information and Communications Technology, Japan (Oct. 2019 summer)
  • - Research Assistant, Academia Sinica, Taiwan (Oct. 2015 - Sep. 2017), Advisors: Hsin-Min Wang, Yu Tsao
  • - Software R&D Engineer, ASUS, Taiwan (Oct. 2013 - Mar. 2015)
  • - System Designer, Realtek, Taiwan (Mar. 2012 - Oct. 2013)
  • - Research Assistant, National Chiao Tung University, Taiwan (Sep. 2009 - Dec. 2011), Advisors: Sin-Horng Chen, Yih-Ru Wang
Education
  • - Ph.D. (2021), Graduate School of Informatics, Nagoya University, Advisor: Tomoki Toda
  • - M.S. (2011), School of Communication Engineering, National Chiao Tung University, Advisors: Sin-Horng Chen, Yih-Ru Wang
  • - B.S. (2009), School of Communication Engineering, National Chiao Tung University
Background
  • Currently a research scientist at Meta FAIR. Research interests include speech generation applications based on machine learning methods, such as neural vocoders, voice conversion, speech enhancement, and speech bandwidth expansion.
Miscellany
  • Contact:
  • - Email: yichiao.wu@g.sp.m.is.nagoya-u.ac.jp, yichiaowu@meta.com
  • - Github, GoogleScholar, ResearchGate, ResearchMap, Web of Science, Linkedin, YouTube, Medium