Xu Tan

Cited by

	All	Since 2019
Citations	10347	10241
h-index	44	44
i10-index	93	92

3800

1900

950

2850

201820192020202120222023202486 391 892 1558 2284 3778 1254

Public access

View all

24 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Tao QinSenior Principal Research Manager, Microsoft ResearchVerified email at microsoft.com
Tie-Yan LiuDistinguished Scientist, Microsoft Research AI4Science | IEEE Fellow | ACM Fellow | AAIA FellowVerified email at microsoft.com
Sheng ZhaoMicrosoftVerified email at microsoft.com
Kaitao SongSenior Researcher, Microsoft ResearchVerified email at microsoft.com
Yi Ren (任意)Research Scientist, TiktokVerified email at bytedance.com
Zhou ZhaoZhejiang UniversityVerified email at zju.edu.cn
Yichong LengUniversity of Science and Technology of ChinaVerified email at mail.ustc.edu.cn
Jiang BianMicrosoft ResearchVerified email at microsoft.com
Rui WangMicrosoft Research AsiaVerified email at microsoft.com
Renqian LuoMicrosoft ResearchVerified email at microsoft.com
Junliang GuoMicrosoft ResearchVerified email at microsoft.com
Lei HePrincipal Scientist Manager, MicrosoftVerified email at microsoft.com
Tianyu HeMicrosoft ResearchVerified email at microsoft.com
Arul MenezesMicrosoft ResearchVerified email at microsoft.com
Hany Hassan AwadallaMicrosoft ResearchVerified email at microsoft.com
Ming Zhou (周明)Chief Scientist at Sinovation, ACL president (2019), VP of CCF(2020-2024)Verified email at chuangxin.com
Xuedong D. HuangMicrosoftVerified email at microsoft.com
Yuanchao ShuMicrosoft ResearchVerified email at microsoft.com
JIMING CHENProfessor at Zhejiang UniversityVerified email at ieee.org
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerified email at umontreal.ca

Xu Tan

Principal Researcher and Research Manager, Microsoft

Verified email at microsoft.com - Homepage

Large Language Models Speech Synthesis Music Generation Avatar Multimodality


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech Y Ren, C Hu, X Tan, T Qin, S Zhao, Z Zhao, TY Liu ICLR 2021, 2020	1159	2020
MASS: Masked Sequence to Sequence Pre-training for Language Generation K Song, X Tan, T Qin, J Lu, TY Liu ICML 2019, 2019	1084	2019
FastSpeech: Fast, Robust and Controllable Text to Speech Y Ren, Y Ruan, X Tan, T Qin, S Zhao, Z Zhao, TY Liu NIPS 2019, 2019	1024	2019
Mpnet: Masked and permuted pre-training for language understanding K Song, X Tan, T Qin, J Lu, TY Liu Advances in neural information processing systems 33, 16857-16867, 2020	761	2020
Achieving human parity on automatic chinese to english news translation H Hassan, A Aue, C Chen, V Chowdhary, J Clark, C Federmann, X Huang, ... arXiv preprint arXiv:1803.05567, 2018	670	2018
Hugginggpt: Solving ai tasks with chatgpt and its friends in hugging face Y Shen, K Song, X Tan, D Li, W Lu, Y Zhuang Advances in Neural Information Processing Systems 36, 2024	551	2024
A survey on neural speech synthesis X Tan, T Qin, F Soong, TY Liu arXiv preprint arXiv:2106.15561, 2021	323	2021
Multilingual Neural Machine Translation with Knowledge Distillation X Tan, Y Ren, D He, T Qin, Z Zhao, TY Liu ICLR 2019, 2019	245	2019
ESPnet-TTS: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit T Hayashi, R Yamamoto, K Inoue, T Yoshimura, S Watanabe, T Toda, ... ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020	216	2020
Representation Degeneration Problem in Training Natural Language Generation Models J Gao, D He, X Tan, T Qin, L Wang, T Liu ICLR 2019, 2018	208	2018
FRAGE: frequency-agnostic word representation C Gong, D He, X Tan, T Qin, L Wang, Liu, Tie-Yan NIPS 2018, 2018	171	2018
Adaspeech: Adaptive text to speech for custom voice M Chen, X Tan, B Li, Y Liu, T Qin, S Zhao, TY Liu ICLR 2021, 2021	139	2021
Layer-Wise Coordination between Encoder and Decoder for Neural Machine Translation T He, X Tan, Y Xia, D He, T Qin, Z Chen, Liu, Tie-Yan NIPS 2018, 2018	129	2018
Non-Autoregressive Neural Machine Translation with Enhanced Decoder Input J Guo, X Tan, D He, T Qin, L Xu, TY Liu AAAI 2019, 2018	127	2018
Almost Unsupervised Text to Speech and Automatic Speech Recognition Y Ren, X Tan, T Qin, S Zhao, Z Zhao, TY Liu ICML 2019, 2019	115	2019
Multilingual neural machine translation with language clustering X Tan, J Chen, D He, Y Xia, T Qin, TY Liu EMNLP 2019, 2019	110	2019
Popmag: Pop music accompaniment generation Y Ren, J He, X Tan, T Qin, Z Zhao, TY Liu Proceedings of the 28th ACM international conference on multimedia, 1198-1206, 2020	105	2020
Naturalspeech: End-to-end text-to-speech synthesis with human-level quality X Tan, J Chen, H Liu, J Cong, C Zhang, Y Liu, X Wang, Y Leng, Y Yi, L He, ... IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024	104	2024
Musicbert: Symbolic music understanding with large-scale pre-training M Zeng, X Tan, R Wang, Z Ju, T Qin, TY Liu ACL 2021, 2021	97	2021
Multispeech: Multi-speaker text to speech with transformer M Chen, X Tan, Y Ren, J Xu, H Sun, S Zhao, T Qin, TY Liu INTERSPEECH 2020, 2020	93	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors