Paper 'Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control' submitted to ICASSP 2025
Paper 'LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning' accepted to Interspeech 2024
Paper 'CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection' accepted to Interspeech 2024
Paper 'Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment' accepted to Interspeech 2024
Paper 'SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark' accepted to Interspeech 2024
Background
A software engineer/researcher at LY Corporation, based in Nagoya, Japan. Also a Ph.D. student at Nagoya University with research interests including statistical speech synthesis, voice conversion, singing voice synthesis, and machine learning.