Stargan-vc: Non-parallel many-to-many voice conversion using star generative adversarial networks H Kameoka, T Kaneko, K Tanaka, N Hojo 2018 IEEE Spoken Language Technology Workshop (SLT), 266-273, 2018 | 498 | 2018 |
Cyclegan-vc2: Improved cyclegan-based non-parallel voice conversion T Kaneko, H Kameoka, K Tanaka, N Hojo ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 345 | 2019 |
ACVAE-VC: Non-parallel voice conversion with auxiliary classifier variational autoencoder H Kameoka, T Kaneko, K Tanaka, N Hojo IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (9), 1432 …, 2019 | 183 | 2019 |
Stargan-vc2: Rethinking conditional methods for stargan-based voice conversion T Kaneko, H Kameoka, K Tanaka, N Hojo arXiv preprint arXiv:1907.12279, 2019 | 180 | 2019 |
Generative adversarial network-based postfilter for statistical parametric speech synthesis T Kaneko, H Kameoka, N Hojo, Y Ijima, K Hiramatsu, K Kashino 2017 IEEE international conference on acoustics, speech and signal …, 2017 | 166 | 2017 |
AttS2S-VC: Sequence-to-sequence voice conversion with attention and context preservation mechanisms K Tanaka, H Kameoka, T Kaneko, N Hojo ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 127 | 2019 |
Cyclegan-vc3: Examining and improving cyclegan-vcs for mel-spectrogram conversion T Kaneko, H Kameoka, K Tanaka, N Hojo arXiv preprint arXiv:2010.11672, 2020 | 108 | 2020 |
ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion H Kameoka, K Tanaka, D Kwaśny, T Kaneko, N Hojo IEEE/ACM Transactions on audio, speech, and language processing 28, 1849-1863, 2020 | 79 | 2020 |
Maskcyclegan-vc: Learning non-parallel voice conversion with filling in frames T Kaneko, H Kameoka, K Tanaka, N Hojo ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 72 | 2021 |
An Investigation of DNN-Based Speech Synthesis Using Speaker Codes. N Hojo, Y Ijima, H Mizuno INTERSPEECH, 2278-2282, 2016 | 63 | 2016 |
DNN-based speech synthesis using speaker codes N Hojo, Y Ijima, H Mizuno IEICE TRANSACTIONS on Information and Systems 101 (2), 462-472, 2018 | 55 | 2018 |
Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks K Tanaka, T Kaneko, N Hojo, H Kameoka 2018 IEEE Spoken Language Technology Workshop (SLT), 632-639, 2018 | 49 | 2018 |
Generative adversarial network-based approach to signal reconstruction from magnitude spectrogram K Oyamada, H Kameoka, T Kaneko, K Tanaka, N Hojo, H Ando 2018 26th European Signal Processing Conference (EUSIPCO), 2514-2518, 2018 | 45 | 2018 |
Many-to-many voice transformer network H Kameoka, WC Huang, K Tanaka, T Kaneko, N Hojo, T Toda IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 656-670, 2020 | 38 | 2020 |
An investigation to transplant emotional expressions in DNN-based TTS synthesis K Inoue, S Hara, M Abe, N Hojo, Y Ijima 2017 Asia-Pacific Signal and Information Processing Association Annual …, 2017 | 36 | 2017 |
Model architectures to extrapolate emotional expressions in DNN-based text-to-speech K Inoue, S Hara, M Abe, N Hojo, Y Ijima Speech Communication 126, 35-43, 2021 | 28 | 2021 |
Nonparallel voice conversion with augmented classifier star generative adversarial networks H Kameoka, T Kaneko, K Tanaka, N Hojo IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2982-2995, 2020 | 26 | 2020 |
WaveCycleGAN2: Time-domain neural post-filter for speech waveform generation K Tanaka, H Kameoka, T Kaneko, N Hojo arXiv preprint arXiv:1904.02892, 2019 | 26 | 2019 |
Voicegrad: Non-parallel any-to-many voice conversion with annealed langevin dynamics H Kameoka, T Kaneko, K Tanaka, N Hojo, S Seki arXiv preprint arXiv:2010.02977, 2020 | 16 | 2020 |
WaveCycleGAN: Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks K Tanaka, T Kaneko, N Hojo, H Kameoka arXiv preprint arXiv:1809.10288, 2018 | 10 | 2018 |