Accent and speaker disentanglement in many-to-many voice conversion Z Wang, W Ge, X Wang, S Yang, W Gan, H Chen, H Li, L Xie, X Li 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 35 | 2021 |
Cross-speaker emotion disentangling and transfer for end-to-end speech synthesis T Li, X Wang, Q Xie, Z Wang, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1448-1460, 2022 | 34 | 2022 |
Lm-vc: Zero-shot voice conversion via speech generation based on language models Z Wang, Y Chen, L Xie, Q Tian, Y Wang IEEE Signal Processing Letters, 2023 | 23 | 2023 |
Enriching source style transfer in recognition-synthesis based non-parallel voice conversion Z Wang, X Zhou, F Yang, T Li, H Du, L Xie, W Gan, H Chen, H Li arXiv preprint arXiv:2106.08741, 2021 | 21 | 2021 |
One-shot voice conversion for style transfer based on speaker adaptation Z Wang, Q Xie, T Li, H Du, L Xie, P Zhu, M Bi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 15 | 2022 |
Cross-speaker emotion transfer based on prosody compensation for end-to-end speech synthesis T Li, X Wang, Q Xie, Z Wang, M Jiang, L Xie arXiv preprint arXiv:2207.01198, 2022 | 13 | 2022 |
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 12 | 2023 |
Multi-speaker multi-style text-to-speech synthesis with single-speaker single-style training data scenarios Q Xie, T Li, X Wang, Z Wang, L Xie, G Yu, G Wan 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | 11 | 2022 |
The NUS & NWPU system for voice conversion challenge 2020 X Tian, Z Wang, S Yang, X Zhou, H Du, Y Zhou, M Zhang, K Zhou, ... Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020 | 10 | 2020 |
Iqdubbing: Prosody modeling based on discrete self-supervised speech representation for expressive voice conversion W Gan, B Wen, Y Yan, H Chen, Z Wang, H Du, L Xie, K Guo, H Li arXiv preprint arXiv:2201.00269, 2022 | 8 | 2022 |
Streaming voice conversion via intermediate bottleneck features and non-streaming teacher guidance Y Chen, M Tu, T Li, X Li, Q Kong, J Li, Z Wang, Q Tian, Y Wang, Y Wang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 7 | 2023 |
AccentSpeech: Learning accent from crowd-sourced data for target speaker TTS with accents Y Zhang, Z Wang, P Yang, H Sun, Z Wang, L Xie 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | 7 | 2022 |
Vits-Based Singing Voice Conversion Leveraging Whisper and Multi-Scale F0 Modeling Z Ning, Y Jiang, Z Wang, B Zhang, L Xie 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 4 | 2023 |
MSM-VC: high-fidelity source style transfer for non-parallel voice conversion by multi-scale style modeling Z Wang, X Wang, Q Xie, T Li, L Xie, Q Tian, Y Wang IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 | 3 | 2023 |
StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion Z Wang, Y Chen, X Wang, Z Chen, L Xie, Y Wang, Y Wang arXiv preprint arXiv:2401.11053, 2024 | 2 | 2024 |
Delivering speaking style in low-resource voice conversion with multi-factor constraints Z Wang, X Wang, L Xie, Y Chen, Q Tian, Y Wang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
Multi-level temporal-channel speaker retrieval for robust zero-shot voice conversion Z Wang, L Xue, Q Kong, L Xie, Y Chen, Q Tian, Y Wang arXiv preprint arXiv:2305.07204, 2023 | 2 | 2023 |
Vec-Tok-VC+: Residual-enhanced Robust Zero-shot Voice Conversion with Progressive Constraints in a Dual-mode Training Strategy L Ma, X Zhu, Y Lv, Z Wang, Z Wang, W He, H Zhou, L Xie arXiv preprint arXiv:2406.09844, 2024 | 1 | 2024 |
DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion Z Ning, S Wang, P Zhu, Z Wang, J Yao, L Xie, M Bi arXiv preprint arXiv:2406.07846, 2024 | 1 | 2024 |
U-Style: Cascading U-nets with Multi-level Speaker and Style Modeling for Zero-Shot Voice Cloning T Li, Z Wang, X Zhu, J Cong, Q Tian, Y Wang, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | | 2024 |