Follow
Takaaki Saeki
Takaaki Saeki
Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
T Saeki, D Xin, W Nakata, T Koriyama, S Takamichi, H Saruwatari
arXiv preprint arXiv:2204.02152, 2022
1492022
Espnet2-tts: Extending the edge of tts research
T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ...
arXiv preprint arXiv:2110.07840, 2021
622021
SpeechLMScore: Evaluating speech generation using speech language model
S Maiti, Y Peng, T Saeki, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
302023
JTubeSpeech: corpus of Japanese speech collected from YouTube for speech recognition and speaker verification
S Takamichi, L Kürzinger, T Saeki, S Shiota, S Watanabe
arXiv preprint arXiv:2112.09323, 2021
232021
Incremental text-to-speech synthesis using pseudo lookahead with large pretrained language model
T Saeki, S Takamichi, H Saruwatari
IEEE Signal Processing Letters 28, 857-861, 2021
222021
Real-Time, Full-Band, Online DNN-Based Voice Conversion System Using a Single CPU.
T Saeki, Y Saito, S Takamichi, H Saruwatari
INTERSPEECH, 1021-1022, 2020
162020
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-to-Speech
T Saeki, H Zen, Z Chen, N Morioka, G Wang, Y Zhang, A Bapna, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
142023
Duration-aware pause insertion using pre-trained language model for multi-speaker text-to-speech
D Yang, T Koriyama, Y Saito, T Saeki, D Xin, H Saruwatari
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
132023
Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining
T Saeki, S Maiti, X Li, S Watanabe, S Takamichi, H Saruwatari
arXiv preprint arXiv:2301.12596, 2023
122023
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning
T Saeki, K Tachibana, R Yamamoto
arXiv preprint arXiv:2203.15683, 2022
122022
Text-to-speech synthesis from dark data with evaluation-in-the-loop data selection
K Seki, S Takamichi, T Saeki, H Saruwatari
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
102023
Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data
T Saeki, G Wang, N Morioka, I Elias, K Kastner, A Rosenberg, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
92024
SpeechBERTScore: Reference-Aware Automatic Evaluation of Speech Generation Leveraging NLP Evaluation Metrics
T Saeki, S Maiti, S Takamichi, S Watanabe, H Saruwatari
arXiv preprint arXiv:2401.16812, 2024
92024
End-to-End Deep Learning Speech Recognition Model for Silent Speech Challenge.
N Kimura, Z Su, T Saeki
INTERSPEECH, 1025-1026, 2020
82020
Yodas: Youtube-Oriented Dataset for Audio and Speech
X Li, S Takamichi, T Saeki, W Chen, S Shiota, S Watanabe
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
72023
SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel Modeling
T Saeki, S Takamichi, T Nakamura, N Tanji, H Saruwatari
arXiv preprint arXiv:2203.12937, 2022
62022
Heiga Zen, Zhehuai Chen, Nobuyuki Morioka, Gary Wang, Yu Zhang, Ankur Bapna, Andrew Rosenberg, and Bhuvana Ramabhadran. Virtuoso: Massive multilingual speech-text joint semi …
T Saeki
ICASSP 2023, 1-5, 2023
52023
Diversity-based core-set selection for text-to-speech with linguistic and acoustic features
K Seki, S Takamichi, T Saeki, H Saruwatari
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network
T Saeki, S Takamichi, H Saruwatari
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
42021
Real-Time Full-Band Voice Conversion with Sub-Band Modeling and Data-Driven Phase Estimation of Spectral Differentials
T Saeki, Y Saito, S Takamichi, H Saruwatari
IEICE TRANSACTIONS on Information and Systems 104 (7), 1002-1016, 2021
42021
The system can't perform the operation now. Try again later.
Articles 1–20