Hiroki Kanagawa
Scholar

Hiroki Kanagawa

Google Scholar ID: PiG2uNEAAAAJ
NTT Inc.
speech synthesisvoice conversionautomatic speech recognition
Citations & Impact
All-time
Citations
71
 
H-index
4
 
i10-index
3
 
Publications
20
 
Co-authors
6
list available
Resume (English only)
Academic Achievements
  • - 2024: Hiroki Kanagawa, Takafumi Moriya, Yusuke Ijima, “Pre-training Neural Transducer-based Streaming Voice Conversion for Faster Convergence and Alignment-free Training,” Proc. Interspeech 2024, pp. 2755-2759, Aug. 2024 @ Kos, Greece.
  • - 2024: Hiroki Kanagawa and Yusuke Ijima, “Knowledge Distillation from Self-Supervised Representation Learning Model with Discrete Speech Units for Any-to-Any Streaming Voice Conversion,” Proc. Interspeech 2024, pp. 4393-4397, Aug. 2024 @ Kos, Greece.
Research Experience
  • - Dec, 2017 - Now: Research Engineer, Human Insight Laboratory, NTT Human Informatics Laboratories, NTT Corporation, Japan
  • - Apr, 2013 - Nov, 2017: Research engineer at Information Technology R&D Center, Mitsubishi Electric Corporation, Japan
Education
  • - M.E. in information processing from Tokyo Institute of Technology, 2013 (Supervisor: Takao Kobayashi)
  • - B.E. in electronics engineering from The University of Electro-Communications, 2011 (Supervisor: Tetsuo Kirimoto)
Background
  • Research Interests: text-to-speech synthesis, voice conversion, neural vocoder, machine learning, automatic speech recognition
  • Professional Field: Information Processing and Electronics Engineering
  • Profile: Research Engineer at NTT Corporation
Miscellany
  • Programming Skills: C, C++ (good at fast implementing using e.g. SIMD), C++/CLI, C#, Perl, Python (over 10 years), WPF, ASP.net, Javascript
  • Hobbies: Bonsai, Playing/watching baseball, Watching anime