Follow
Shan Yang
Shan Yang
Tencent AI Lab
Verified email at nwpu-aslp.org
Title
Cited by
Cited by
Year
Multi-band melgan: Faster waveform generation for high-quality text-to-speech
G Yang, S Yang, K Liu, P Fang, W Chen, L Xie
2021 IEEE Spoken Language Technology Workshop (SLT), 492-498, 2021
2432021
Controllable emotion transfer for end-to-end speech synthesis
T Li, S Yang, L Xue, L Xie
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
852021
A deep bidirectional LSTM approach for video-realistic talking head
B Fan, L Xie, S Yang, L Wang, FK Soong
Multimedia Tools and Applications 75, 5287-5309, 2016
682016
Msemotts: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis
Y Lei, S Yang, X Wang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 853-864, 2022
672022
Statistical parametric speech synthesis using generative adversarial networks under a multi-task learning framework
S Yang, L Xie, X Chen, X Lou, X Zhu, D Huang, H Li
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
652017
Fine-grained emotion strength transfer, control and prediction for emotional speech synthesis
Y Lei, S Yang, L Xie
2021 IEEE Spoken Language Technology Workshop (SLT), 423-430, 2021
622021
Controlling emotion strength with relative attribute for end-to-end speech synthesis
Z Xiaolian, Y Shan, X Geng, Yang, Lei
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
542019
Pre-alignment guided attention for improving training efficiency and model stability in end-to-end speech synthesis
X Zhu, Y Zhang, S Yang, L Xue, L Xie
IEEE Access 7, 65955-65964, 2019
392019
On the localness modeling for the self-attention based end-to-end speech synthesis
S Yang, H Lu, S Kang, L Xue, J Xiao, D Su, L Xie, D Yu
Neural Networks 125, 121-130, 2020
362020
Accent and speaker disentanglement in many-to-many voice conversion
Z Wang, W Ge, X Wang, S Yang, W Gan, H Chen, H Li, L Xie, X Li
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
352021
Controllable context-aware conversational speech synthesis
J Cong, S Yang, N Hu, G Li, L Xie, D Su
Interspeech, 2021, 4658-4662, 2021
332021
Data efficient voice cloning from noisy samples with domain adversarial training
J Cong, S Yang, L Xie, G Yu, G Wan
arXiv preprint arXiv:2008.04265, 2020
332020
Glow-wavegan: Learning speech representations from gan-based variational auto-encoder for high fidelity flow-based speech synthesis
J Cong, S Yang, L Xie, D Su
Interspeech, 2021, 2021
292021
Learning Hierarchical Representations for Expressive Speaking Style in End-to-End Speech Synthesis
X An, Y Wang, S Yang, Z Ma, L Xie
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
222019
Enhancing Hybrid Self-attention Structure with Relative-position-aware Bias for Speech Synthesis
S Yang, H Lu, S Kang, L Xie, D Yu
2019 IEEE International Conference on Acoustics, Speech and Signal …, 2019
192019
Improving Mandarin End-to-End Speech Synthesis by Self-Attention and Learnable Gaussian Bias
F Yang, S Yang, P Zhu, P Yan, L Xie
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
172019
On the training of dnn-based average voice model for speech synthesis
S Yang, Z Wu, L Xie
2016 Asia-Pacific Signal and Information Processing Association Annual …, 2016
172016
Glow-WaveGAN 2: High-quality Zero-shot Text-to-speech Synthesis and Any-to-any Voice Conversion
Y Lei, S Yang, J Cong, L Xie, D Su
Interspeech, 2022, 2022
162022
The USTC system for blizzard challenge 2017
YJ Hu, C Ding, LJ Liu, ZH Ling, LR Dai
Proc. Blizzard Challenge Workshop, 2017
162017
Clinical genetics testing laboratories have a remarkably low rate of clinically significant discordance when interpreting variants in hereditary cancer syndrome genes
RL Nussbaum, S Yang, SE Lincoln
J Clin Oncol 35 (11), 1259-1261, 2017
142017
The system can't perform the operation now. Try again later.
Articles 1–20