Scholar

Hiroki Kanagawa

Google Scholar ID: PiG2uNEAAAAJ

NTT Inc.

speech synthesisvoice conversionautomatic speech recognition

Citations & Impact

All-time

Citations

H-index

i10-index

Publications

Co-authors

list available

Contact

Publications

1 items

2025

Cited

Resume (English only)

Academic Achievements

- 2024: Hiroki Kanagawa, Takafumi Moriya, Yusuke Ijima, “Pre-training Neural Transducer-based Streaming Voice Conversion for Faster Convergence and Alignment-free Training,” Proc. Interspeech 2024, pp. 2755-2759, Aug. 2024 @ Kos, Greece.
- 2024: Hiroki Kanagawa and Yusuke Ijima, “Knowledge Distillation from Self-Supervised Representation Learning Model with Discrete Speech Units for Any-to-Any Streaming Voice Conversion,” Proc. Interspeech 2024, pp. 4393-4397, Aug. 2024 @ Kos, Greece.

Research Experience

- Dec, 2017 - Now: Research Engineer, Human Insight Laboratory, NTT Human Informatics Laboratories, NTT Corporation, Japan
- Apr, 2013 - Nov, 2017: Research engineer at Information Technology R&D Center, Mitsubishi Electric Corporation, Japan

Education

- M.E. in information processing from Tokyo Institute of Technology, 2013 (Supervisor: Takao Kobayashi)
- B.E. in electronics engineering from The University of Electro-Communications, 2011 (Supervisor: Tetsuo Kirimoto)

Background

Research Interests: text-to-speech synthesis, voice conversion, neural vocoder, machine learning, automatic speech recognition
Professional Field: Information Processing and Electronics Engineering
Profile: Research Engineer at NTT Corporation

Miscellany

Programming Skills: C, C++ (good at fast implementing using e.g. SIMD), C++/CLI, C#, Perl, Python (over 10 years), WPF, ASP.net, Javascript
Hobbies: Bonsai, Playing/watching baseball, Watching anime

Co-authors

6 total