Opencpop: A high-quality open source chinese popular song corpus for singing voice synthesis Y Wang, X Wang, P Zhu, J Wu, H Li, H Xue, Y Zhang, L Xie, M Bi arXiv preprint arXiv:2201.07429, 2022 | 86 | 2022 |
Visinger: Variational inference with adversarial learning for end-to-end singing voice synthesis Y Zhang, J Cong, H Xue, L Xie, P Zhu, M Bi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 74 | 2022 |
Head motion synthesis from speech using deep neural networks C Ding, L Xie, P Zhu Multimedia Tools and Applications 74, 9871-9888, 2015 | 66 | 2015 |
Articulatory movement prediction using deep bidirectional long short-term memory based recurrent neural networks and word/phone embeddings. P Zhu, L Xie, Y Chen Interspeech, 2192-2196, 2015 | 41 | 2015 |
BLSTM neural networks for speech driven head motion synthesis. C Ding, P Zhu, L Xie Interspeech, 3345-3349, 2015 | 28 | 2015 |
Improving mandarin end-to-end speech synthesis by self-attention and learnable gaussian bias F Yang, S Yang, P Zhu, P Yan, L Xie 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019 | 17 | 2019 |
One-shot voice conversion for style transfer based on speaker adaptation Z Wang, Q Xie, T Li, H Du, L Xie, P Zhu, M Bi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 15 | 2022 |
Speech-driven head motion synthesis using neural networks C Ding, P Zhu, L Xie, D Jiang, ZH Fu Fifteenth Annual Conference of the International Speech Communication …, 2014 | 13 | 2014 |
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 12 | 2023 |
Dualvc: Dual-mode voice conversion using intra-model knowledge distillation and hybrid predictive coding Z Ning, Y Jiang, P Zhu, J Yao, S Wang, L Xie, M Bi arXiv preprint arXiv:2305.12425, 2023 | 10 | 2023 |
Learn2sing 2.0: Diffusion and mutual information-based target speaker svs by learning from singing teacher H Xue, X Wang, Y Zhang, L Xie, P Zhu, M Bi arXiv preprint arXiv:2203.16408, 2022 | 9 | 2022 |
Dualvc 2: Dynamic masked convolution for unified streaming and non-streaming voice conversion Z Ning, Y Jiang, P Zhu, S Wang, J Yao, L Xie, M Bi ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
Accent-VITS: accent transfer for end-to-end TTS L Ma, Y Zhang, X Zhu, Y Lei, Z Ning, P Zhu, L Xie National Conference on Man-Machine Speech Communication, 203-214, 2023 | 3 | 2023 |
Head motion generation for speech driven talking avatar B Li, L Xie, P Zhu, B Fan Journal of Tsinghua Univ (Sci & Tech) 53 (6), 898-902, 2013 | 3 | 2013 |
Speech recognition by selecting and refining hot words F Jin, W Liu, LJ Ma, PCPP Zhu, Y Qin, Q Shi, SL Zhang US Patent 10,607,601, 2020 | 2 | 2020 |
DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion Z Ning, S Wang, P Zhu, Z Wang, J Yao, L Xie, M Bi arXiv preprint arXiv:2406.07846, 2024 | 1 | 2024 |
E1 TTS: Simple and Fast Non-Autoregressive TTS Z Liu, S Wang, P Zhu, M Bi, H Li arXiv preprint arXiv:2409.09351, 2024 | | 2024 |
MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion S Inoue, S Wang, W Wang, P Zhu, M Bi, H Li arXiv preprint arXiv:2409.09352, 2024 | | 2024 |
Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models H Xue, S Guo, P Zhu, M Bi arXiv preprint arXiv:2308.10428, 2023 | | 2023 |
Text-to-articulatory movement W Liu, Q Shi, SL Zhang, PC Zhu US Patent 10,521,945, 2019 | | 2019 |