Natural tts synthesis by conditioning wavenet on mel spectrogram predictions J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 3246 | 2018 |
Transfer learning from speaker verification to multispeaker text-to-speech synthesis Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ... Advances in neural information processing systems 31, 2018 | 973 | 2018 |
Hierarchical generative modeling for controllable speech synthesis WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ... arXiv preprint arXiv:1810.07217, 2018 | 276 | 2018 |
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019 | 209 | 2019 |
Parallel tacotron: Non-autoregressive and controllable tts I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 126 | 2021 |
SATzilla2012: Improved algorithm selection based on cost-sensitive classification models L Xu, F Hutter, J Shen, HH Hoos, K Leyton-Brown Proceedings of SAT Challenge, 57-58, 2012 | 126 | 2012 |
Non-attentive tacotron: Robust and controllable neural tts synthesis including unsupervised duration modeling J Shen, Y Jia, M Chrzanowski, Y Zhang, I Elias, H Zen, Y Wu arXiv preprint arXiv:2010.04301, 2020 | 99 | 2020 |
PnG BERT: Augmented BERT on phonemes and graphemes for neural TTS Y Jia, H Zen, J Shen, Y Zhang, Y Wu arXiv preprint arXiv:2103.15060, 2021 | 85 | 2021 |
Neural program synthesis with priority queue training DA Abolafia, M Norouzi, J Shen, R Zhao, QV Le arXiv preprint arXiv:1801.03526, 2018 | 69 | 2018 |
Parallel Tacotron 2: A non-autoregressive neural TTS model with differentiable duration modeling I Elias, H Zen, J Shen, Y Zhang, Y Jia, RJ Skerry-Ryan, Y Wu arXiv preprint arXiv:2103.14574, 2021 | 68 | 2021 |
Synthesizing speech from text using neural networks Y Wu, J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, ... US Patent 10,971,170, 2021 | 59 | 2021 |
In teacher we trust: Learning compressed models for pedestrian detection J Shen, N Vesdapunt, VN Boddeti, KM Kitani arXiv preprint arXiv:1612.00478, 2016 | 39 | 2016 |
Examining scaling and transfer of language model architectures for machine translation B Zhang, B Ghorbani, A Bapna, Y Cheng, X Garcia, J Shen, O Firat International Conference on Machine Learning, 26176-26192, 2022 | 12 | 2022 |
Synthesis of speech from text in a voice of a target speaker using neural networks Y Jia, Z Chen, Y Wu, J Shen, R Pang, RJ Weiss, IL Moreno, F Ren, ... US Patent 11,488,575, 2022 | 7 | 2022 |
Training text-to-speech systems from synthetic data: A practical approach for accent transfer tasks L Finkelstein, H Zen, N Casagrande, C Chan, Y Jia, T Kenter, A Petelin, ... arXiv preprint arXiv:2208.13183, 2022 | 5 | 2022 |
Parallel tacotron non-autoregressive and controllable TTS I Elias, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu, B Chun US Patent 11,908,448, 2024 | 3 | 2024 |
Modelling intonation in spectrograms for neural vocoder based text-to-speech V Wan, J Shen, H Silen, R Clark Speech Prosody 2020, 2020 | 2 | 2020 |
Text-to-speech using duration prediction Y Zhang, I Elias, B Chun, Y Jia, Y Wu, M Chrzanowski, J Shen US Patent 12,100,382, 2024 | 1 | 2024 |
Phonemes and graphemes for neural text-to-speech Y Jia, B Chun, Y Zhang, J Shen, Y Wu US Patent 12,020,685, 2024 | | 2024 |
Parallel Tacotron Non-Autoregressive and Controllable TTS I Elias, J Shen, Y Zhang, Y Jia, RJ Weiss, Y Wu, B Chun US Patent App. 18/421,116, 2024 | | 2024 |