Follow
yi lei
yi lei
Unknown affiliation
Verified email at nwpu.edu.cn
Title
Cited by
Cited by
Year
Msemotts: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis
Y Lei, S Yang, X Wang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 853-864, 2022
682022
Fine-grained emotion strength transfer, control and prediction for emotional speech synthesis
Y Lei, S Yang, L Xie
2021 IEEE Spoken Language Technology Workshop (SLT), 423-430, 2021
622021
Promptstyle: Controllable style transfer for text-to-speech with natural language descriptions
G Liu, Y Zhang, Y Lei, Y Chen, R Wang, Z Li, L Xie
arXiv preprint arXiv:2305.19522, 2023
332023
Glow-wavegan 2: High-quality zero-shot text-to-speech synthesis and any-to-any voice conversion
Y Lei, S Yang, J Cong, L Xie, D Su
arXiv preprint arXiv:2207.01832, 2022
162022
Promptvc: Flexible stylistic voice conversion in latent space driven by natural language prompts
J Yao, Y Yang, Y Lei, Z Ning, Y Hu, Y Pan, J Yin, H Zhou, H Lu, L Xie
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
142024
Multi-speaker expressive speech synthesis via multiple factors decoupling
X Zhu, Y Lei, K Song, Y Zhang, T Li, L Xie
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
132023
Cross-speaker emotion transfer through information perturbation in emotional speech synthesis
Y Lei, S Yang, X Zhu, L Xie, D Su
IEEE Signal Processing Letters 29, 1948-1952, 2022
132022
Promptspeaker: Speaker generation based on text descriptions
Y Zhang, G Liu, Y Lei, Y Chen, H Yin, L Xie, Z Li
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023
122023
Dspgan: a gan-based universal vocoder for high-fidelity tts by time-frequency domain supervision from dsp
K Song, Y Zhang, Y Lei, J Cong, H Li, L Xie, G He, J Bai
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
112023
Distinguishable speaker anonymization based on formant and fundamental frequency scaling
J Yao, Q Wang, Y Lei, P Guo, L Xie, N Wang, J Liu
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
102023
Learn2sing: Target speaker singing voice synthesis by learning from a singing teacher
H Xue, S Yang, Y Lei, L Xie, X Li
2021 IEEE Spoken Language Technology Workshop (SLT), 522-529, 2021
102021
Metts: Multilingual emotional text-to-speech by cross-speaker and cross-lingual emotion transfer
X Zhu, Y Lei, T Li, Y Zhang, H Zhou, H Lu, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
92024
Vec-tok speech: Speech vectorization and tokenization for neural speech generation
X Zhu, Y Lv, Y Lei, T Li, W He, H Zhou, H Lu, L Xie
arXiv preprint arXiv:2310.07246, 2023
72023
Preserving background sound in noise-robust voice conversion via multi-task learning
J Yao, Y Lei, Q Wang, P Guo, Z Ning, L Xie, H Li, J Liu, D Xie
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
72023
UniSyn: an end-to-end unified model for text-to-speech and singing voice synthesis
Y Lei, S Yang, X Wang, Q Xie, J Yao, L Xie, D Su
Proceedings of the AAAI Conference on Artificial Intelligence 37 (11), 13025 …, 2023
52023
Styles2st: Zero-shot style transfer for direct speech-to-speech translation
K Song, Y Ren, Y Lei, C Wang, K Wei, L Xie, X Yin, Z Ma
arXiv preprint arXiv:2305.17732, 2023
52023
End-to-end voice conversion with information perturbation
Q Xie, S Yang, Y Lei, L Xie, D Su
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
52022
VITS-based Singing Voice Conversion System with DSPGAN post-processing for SVCC2023
Y Zhou, M Chen, Y Lei, J Zhu, W Zhao
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
42023
Accent-VITS: accent transfer for end-to-end TTS
L Ma, Y Zhang, X Zhu, Y Lei, Z Ning, P Zhu, L Xie
National Conference on Man-Machine Speech Communication, 203-214, 2023
32023
Zero-Shot Emotion Transfer for Cross-Lingual Speech Synthesis
Y Li, X Zhu, Y Lei, H Li, J Liu, D Xie, L Xie
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
12023
The system can't perform the operation now. Try again later.
Articles 1–20