Lightweight LPCNet-Based Neural Vocoder with Tensor Decomposition. H Kanagawa, Y Ijima Interspeech, 205-209, 2020 | 17 | 2020 |
Speaker-independent style conversion for HMM-based expressive speech synthesis H Kanagawa, T Nose, T Kobayashi 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 13 | 2013 |
Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters K Fujita, H Sato, T Ashihara, H Kanagawa, M Delcroix, T Moriya, Y Ijima ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 10 | 2024 |
Zero-shot text-to-speech synthesis conditioned using self-supervised speech representation model K Fujita, T Ashihara, H Kanagawa, T Moriya, Y Ijima 2023 IEEE International Conference on Acoustics, Speech, and Signal …, 2023 | 9 | 2023 |
Feature-space structural MAPLR with regression tree-based multiple transformation matrices for DNN H Kanagawa, Y Tachioka, S Watanabe, J Ishii 2015 Asia-Pacific Signal and Information Processing Association Annual …, 2015 | 7 | 2015 |
Multi-sample subband WaveRNN via multivariate Gaussian H Kanagawa, Y Ijima ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 5 | 2022 |
Efficient building strategy with knowledge distillation for small-footprint acoustic models T Moriya, H Kanagawa, K Matsui, T Fukutomi, Y Shinohara, Y Yamaguchi, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 21-28, 2018 | 4 | 2018 |
Acoustic model learning device, voice synthesis device, and program H Kanagawa, Y Ijima US Patent 11,545,135, 2023 | 2 | 2023 |
A Study on Speaker Independent Style Conversion in HMM Speech Synthesis H Kanagawa, T Nose, T Kobayashi IEICE Technical Report; IEICE Tech. Rep. 111 (365), 191-196, 2011 | 2 | 2011 |
Generating method, generating device, and generating program H Kanagawa US Patent App. 18/038,702, 2024 | 1 | 2024 |
VC-T: Streaming Voice Conversion Based on Neural Transducer H Kanagawa, T Moriya, Y Ijima INTERSPEECH, 2088-2092, 2023 | 1 | 2023 |
Joint Modeling of Multi-Sample and Subband Signals for Fast Neural Vocoding on CPU. H Kanagawa, Y Ijima, H Toda INTERSPEECH, 1626-1630, 2022 | 1 | 2022 |
Multi-Speaker Modeling for DNN-based Speech Synthesis Incorporating Generative Adversarial Networks H Kanagawa, Y Ijima Proc. 10th ISCA Speech Synthesis Workshop, 40-44, 2019 | 1 | 2019 |
HMM 音声合成における不特定話者スタイル変換の検討 金川裕紀, 能勢隆, 小林隆夫 研究報告音声言語情報処理 (SLP) 2011 (32), 1-6, 2011 | 1 | 2011 |
Generating method, generating program, and generating device H Kanagawa US Patent App. 18/576,322, 2024 | | 2024 |
Labeling method, labeling device, and labeling program H Kanagawa US Patent App. 18/038,700, 2024 | | 2024 |
Knowledge Distillation from Self-Supervised Representation Learning Model with Discrete Speech Units for Any-to-Any Streaming Voice Conversion H Kanagawa, Y Ijima Proc. Interspeech 2024, 4393-4397, 2024 | | 2024 |
Pre-training Neural Transducer-based Streaming Voice Conversion for Faster Convergence and Alignment-free Training H Kanagawa, T Moriya, Y Ijima Proc. Interspeech 2024, 2755-2759, 2024 | | 2024 |
Enhancement of Text-Predicting Style Token With Generative Adversarial Network for Expressive Speech Synthesis H Kanagawa, Y Ijima ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | | 2023 |
Expressive Text-to-Speech Synthesis using Text Chat Dataset with Speaking Style Information Y Homma, H Kanagawa, N Kobayashi, Y Ijima, K Saito Transactions of the Japanese Society for Artificial Intelligence 38 (3), F-MA7, 2023 | | 2023 |