Follow
Zhichao Wang
Zhichao Wang
Verified email at mail.nwpu.edu.cn
Title
Cited by
Cited by
Year
Accent and speaker disentanglement in many-to-many voice conversion
Z Wang, W Ge, X Wang, S Yang, W Gan, H Chen, H Li, L Xie, X Li
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
352021
Cross-speaker emotion disentangling and transfer for end-to-end speech synthesis
T Li, X Wang, Q Xie, Z Wang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1448-1460, 2022
342022
Lm-vc: Zero-shot voice conversion via speech generation based on language models
Z Wang, Y Chen, L Xie, Q Tian, Y Wang
IEEE Signal Processing Letters, 2023
232023
Enriching source style transfer in recognition-synthesis based non-parallel voice conversion
Z Wang, X Zhou, F Yang, T Li, H Du, L Xie, W Gan, H Chen, H Li
arXiv preprint arXiv:2106.08741, 2021
212021
One-shot voice conversion for style transfer based on speaker adaptation
Z Wang, Q Xie, T Li, H Du, L Xie, P Zhu, M Bi
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
152022
Cross-speaker emotion transfer based on prosody compensation for end-to-end speech synthesis
T Li, X Wang, Q Xie, Z Wang, M Jiang, L Xie
arXiv preprint arXiv:2207.01198, 2022
132022
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features
Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
122023
Multi-speaker multi-style text-to-speech synthesis with single-speaker single-style training data scenarios
Q Xie, T Li, X Wang, Z Wang, L Xie, G Yu, G Wan
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
112022
The NUS & NWPU system for voice conversion challenge 2020
X Tian, Z Wang, S Yang, X Zhou, H Du, Y Zhou, M Zhang, K Zhou, ...
Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020
102020
Iqdubbing: Prosody modeling based on discrete self-supervised speech representation for expressive voice conversion
W Gan, B Wen, Y Yan, H Chen, Z Wang, H Du, L Xie, K Guo, H Li
arXiv preprint arXiv:2201.00269, 2022
82022
Streaming voice conversion via intermediate bottleneck features and non-streaming teacher guidance
Y Chen, M Tu, T Li, X Li, Q Kong, J Li, Z Wang, Q Tian, Y Wang, Y Wang
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
72023
AccentSpeech: Learning accent from crowd-sourced data for target speaker TTS with accents
Y Zhang, Z Wang, P Yang, H Sun, Z Wang, L Xie
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
72022
Vits-Based Singing Voice Conversion Leveraging Whisper and Multi-Scale F0 Modeling
Z Ning, Y Jiang, Z Wang, B Zhang, L Xie
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
42023
MSM-VC: high-fidelity source style transfer for non-parallel voice conversion by multi-scale style modeling
Z Wang, X Wang, Q Xie, T Li, L Xie, Q Tian, Y Wang
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
32023
StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion
Z Wang, Y Chen, X Wang, Z Chen, L Xie, Y Wang, Y Wang
arXiv preprint arXiv:2401.11053, 2024
22024
Delivering speaking style in low-resource voice conversion with multi-factor constraints
Z Wang, X Wang, L Xie, Y Chen, Q Tian, Y Wang
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
Multi-level temporal-channel speaker retrieval for robust zero-shot voice conversion
Z Wang, L Xue, Q Kong, L Xie, Y Chen, Q Tian, Y Wang
arXiv preprint arXiv:2305.07204, 2023
22023
Vec-Tok-VC+: Residual-enhanced Robust Zero-shot Voice Conversion with Progressive Constraints in a Dual-mode Training Strategy
L Ma, X Zhu, Y Lv, Z Wang, Z Wang, W He, H Zhou, L Xie
arXiv preprint arXiv:2406.09844, 2024
12024
DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion
Z Ning, S Wang, P Zhu, Z Wang, J Yao, L Xie, M Bi
arXiv preprint arXiv:2406.07846, 2024
12024
U-Style: Cascading U-nets with Multi-level Speaker and Style Modeling for Zero-Shot Voice Cloning
T Li, Z Wang, X Zhu, J Cong, Q Tian, Y Wang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20