Takaaki Saeki

Cited by

	All	Since 2019
Citations	286	286
h-index	8	8
i10-index	7	7

120

202020212022202320241 11 40 115 118

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shinnosuke TakamichiKeio UniversityVerified email at keio.jp
Hiroshi SaruwatariProfessor, The University of TokyoVerified email at ipc.i.u-tokyo.ac.jp
Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Soumi MaitiCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Detai XinThe University of TokyoVerified email at ipc.i.u-tokyo.ac.jp
Yuki SaitoLecturer, The University of TokyoVerified email at ipc.i.u-tokyo.ac.jp
Ryuichi YamamotoLY CorporationVerified email at lycorp.co.jp
Wataru NakataThe University of TokyoVerified email at g.ecc.u-tokyo.ac.jp
Sayaka ShiotaTokyo Metropolitan UniversityVerified email at tmu.ac.jp
Xinjian LiGoogleVerified email at google.com
Jiatong Shi (史嘉彤)Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Yusuke YasdaNagoya universityVerified email at g.sp.m.is.nagoya-u.ac.jp
Tomoki HayashiHuman Dataware Lab. Co., Ltd., Nagoya UniversityVerified email at g.sp.m.is.nagoya-u.ac.jp
Yooncheol JuSpeech synthesis AI researcher, 42dot.Inc, Hyundai Motor GroupVerified email at 42dot.ai
Peter WuSchool of Computer Science, Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Takenori YoshimuraNagoya Institute of TechnologyVerified email at nitech.ac.jp
Bhuvana RamabhadranManager, GoogleVerified email at google.com
Andrew RosenbergGoogleVerified email at google.com
Yuta Matsunaga東京大学Verified email at g.ecc.u-tokyo.ac.jp
Kentaro SekiThe University of TokyoVerified email at g.ecc.u-tokyo.ac.jp

Takaaki Saeki

Google

Verified email at google.com - Homepage

Speech Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022 T Saeki, D Xin, W Nakata, T Koriyama, S Takamichi, H Saruwatari arXiv preprint arXiv:2204.02152, 2022	78	2022
Espnet2-tts: Extending the edge of tts research T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ... arXiv preprint arXiv:2110.07840, 2021	51	2021
JTubeSpeech: corpus of Japanese speech collected from YouTube for speech recognition and speaker verification S Takamichi, L Kürzinger, T Saeki, S Shiota, S Watanabe arXiv preprint arXiv:2112.09323, 2021	20	2021
Incremental text-to-speech synthesis using pseudo lookahead with large pretrained language model T Saeki, S Takamichi, H Saruwatari IEEE Signal Processing Letters 28, 857-861, 2021	19	2021
SpeechLMScore: Evaluating speech generation using speech language model S Maiti, Y Peng, T Saeki, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	18	2023
Real-Time, Full-Band, Online DNN-Based Voice Conversion System Using a Single CPU. T Saeki, Y Saito, S Takamichi, H Saruwatari INTERSPEECH, 1021-1022, 2020	14	2020
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning T Saeki, K Tachibana, R Yamamoto arXiv preprint arXiv:2203.15683, 2022	10	2022
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-to-Speech T Saeki, H Zen, Z Chen, N Morioka, G Wang, Y Zhang, A Bapna, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	9	2023
Duration-aware pause insertion using pre-trained language model for multi-speaker text-to-speech D Yang, T Koriyama, Y Saito, T Saeki, D Xin, H Saruwatari ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	8	2023
Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining T Saeki, S Maiti, X Li, S Watanabe, S Takamichi, H Saruwatari arXiv preprint arXiv:2301.12596, 2023	8	2023
End-to-End Deep Learning Speech Recognition Model for Silent Speech Challenge. N Kimura, Z Su, T Saeki INTERSPEECH, 1025-1026, 2020	8	2020
Text-to-speech synthesis from dark data with evaluation-in-the-loop data selection K Seki, S Takamichi, T Saeki, H Saruwatari ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	7	2023
SelfRemaster: Self-Supervised Speech Restoration with Analysis-by-Synthesis Approach Using Channel Modeling T Saeki, S Takamichi, T Nakamura, N Tanji, H Saruwatari arXiv preprint arXiv:2203.12937, 2022	6	2022
Yodas: Youtube-Oriented Dataset for Audio and Speech X Li, S Takamichi, T Saeki, W Chen, S Shiota, S Watanabe 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	4	2023
Lifter training and sub-band modeling for computationally efficient and high-quality voice conversion using spectral differentials T Saeki, Y Saito, S Takamichi, H Saruwatari ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	4	2020
Diversity-based core-set selection for text-to-speech with linguistic and acoustic features K Seki, S Takamichi, T Saeki, H Saruwatari ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	3	2024
Personalized filled-pause generation with group-wise prediction models Y Matsunaga, T Saeki, S Takamichi, H Saruwatari arXiv preprint arXiv:2203.09961, 2022	3	2022
Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network T Saeki, S Takamichi, H Saruwatari 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	3	2021
Real-Time Full-Band Voice Conversion with Sub-Band Modeling and Data-Driven Phase Estimation of Spectral Differentials T Saeki, Y Saito, S Takamichi, H Saruwatari IEICE TRANSACTIONS on Information and Systems 104 (7), 1002-1016, 2021	3	2021
vTTS: visual-text to speech Y Nakano, T Saeki, S Takamichi, K Sudoh, H Saruwatari 2022 IEEE Spoken Language Technology Workshop (SLT), 936-942, 2023	2	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors