Follow
Xu Tan
Xu Tan
Principal Researcher and Research Manager, Microsoft
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Y Ren, C Hu, X Tan, T Qin, S Zhao, Z Zhao, TY Liu
ICLR 2021, 2020
14322020
FastSpeech: Fast, Robust and Controllable Text to Speech
Y Ren, Y Ruan, X Tan, T Qin, S Zhao, Z Zhao, TY Liu
NIPS 2019, 2019
11782019
MASS: Masked Sequence to Sequence Pre-training for Language Generation
K Song, X Tan, T Qin, J Lu, TY Liu
ICML 2019, 2019
11532019
Mpnet: Masked and permuted pre-training for language understanding
K Song, X Tan, T Qin, J Lu, TY Liu
Advances in neural information processing systems 33, 16857-16867, 2020
10182020
Hugginggpt: Solving ai tasks with chatgpt and its friends in hugging face
Y Shen, K Song, X Tan, D Li, W Lu, Y Zhuang
Advances in Neural Information Processing Systems 36, 2024
8082024
Achieving human parity on automatic chinese to english news translation
H Hassan, A Aue, C Chen, V Chowdhary, J Clark, C Federmann, X Huang, ...
arXiv preprint arXiv:1803.05567, 2018
7202018
A survey on neural speech synthesis
X Tan, T Qin, F Soong, TY Liu
arXiv preprint arXiv:2106.15561, 2021
4082021
Multilingual Neural Machine Translation with Knowledge Distillation
X Tan, Y Ren, D He, T Qin, Z Zhao, TY Liu
ICLR 2019, 2019
2662019
Representation Degeneration Problem in Training Natural Language Generation Models
J Gao, D He, X Tan, T Qin, L Wang, T Liu
ICLR 2019, 2018
2542018
ESPnet-TTS: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit
T Hayashi, R Yamamoto, K Inoue, T Yoshimura, S Watanabe, T Toda, ...
ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020
2352020
FRAGE: frequency-agnostic word representation
C Gong, D He, X Tan, T Qin, L Wang, Liu, Tie-Yan
NIPS 2018, 2018
1792018
Adaspeech: Adaptive text to speech for custom voice
M Chen, X Tan, B Li, Y Liu, T Qin, S Zhao, TY Liu
ICLR 2021, 2021
1762021
Naturalspeech: End-to-end text-to-speech synthesis with human-level quality
X Tan, J Chen, H Liu, J Cong, C Zhang, Y Liu, X Wang, Y Leng, Y Yi, L He, ...
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024
1672024
Naturalspeech 2: Latent diffusion models are natural and zero-shot speech and singing synthesizers
K Shen, Z Ju, X Tan, Y Liu, Y Leng, L He, T Qin, S Zhao, J Bian
arXiv preprint arXiv:2304.09116, 2023
1532023
Non-Autoregressive Neural Machine Translation with Enhanced Decoder Input
J Guo, X Tan, D He, T Qin, L Xu, TY Liu
AAAI 2019, 2018
1342018
Layer-Wise Coordination between Encoder and Decoder for Neural Machine Translation
T He, X Tan, Y Xia, D He, T Qin, Z Chen, Liu, Tie-Yan
NIPS 2018, 2018
1292018
Musicbert: Symbolic music understanding with large-scale pre-training
M Zeng, X Tan, R Wang, Z Ju, T Qin, TY Liu
ACL 2021, 2021
1252021
Popmag: Pop music accompaniment generation
Y Ren, J He, X Tan, T Qin, Z Zhao, TY Liu
Proceedings of the 28th ACM international conference on multimedia, 1198-1206, 2020
1242020
Almost Unsupervised Text to Speech and Automatic Speech Recognition
Y Ren, X Tan, T Qin, S Zhao, Z Zhao, TY Liu
ICML 2019, 2019
1242019
Multilingual neural machine translation with language clustering
X Tan, J Chen, D He, Y Xia, T Qin, TY Liu
EMNLP 2019, 2019
1172019
The system can't perform the operation now. Try again later.
Articles 1–20