Opencpop: A high-quality open source chinese popular song corpus for singing voice synthesis Y Wang, X Wang, P Zhu, J Wu, H Li, H Xue, Y Zhang, L Xie, M Bi Interspeech 2022, 2022 | 86 | 2022 |
Msemotts: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis Y Lei, S Yang, X Wang, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 853-864, 2022 | 69 | 2022 |
Experimental study on the relation between internal flow and flashing spray characteristics of R134a using straight tube nozzles XS Wang, B Chen, R Wang, H Xin, ZF Zhou International Journal of Heat and Mass Transfer 115, 524-536, 2017 | 42 | 2017 |
Cross-speaker emotion disentangling and transfer for end-to-end speech synthesis T Li, X Wang, Q Xie, Z Wang, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1448-1460, 2022 | 35 | 2022 |
Generating images from spoken descriptions X Wang, T Qiao, J Zhu, A Hanjalic, O Scharenborg IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 850-865, 2021 | 19 | 2021 |
Atomization and surface heat transfer characteristics of cryogen spray cooling with expansion-chambered nozzles XS Wang, B Chen, ZF Zhou International Journal of Heat and Mass Transfer 121, 15-27, 2018 | 18 | 2018 |
Numerical simulation of cryogen spray cooling by a three-dimensional hybrid vortex method R Wang, B Chen, XS Wang Applied Thermal Engineering 119, 319-330, 2017 | 18 | 2017 |
Anyonenet: Synchronized speech and talking head generation for arbitrary persons X Wang, Q Xie, J Zhu, L Xie, O Scharenborg IEEE Transactions on Multimedia 25, 6717-6728, 2022 | 16 | 2022 |
S2IGAN: Speech-to-Image Generation via Adversarial Learning X Wang, T Qiao, J Zhu, A Hanjalic, O Scharenborg Proc. Interspeech 2020, 2292--2296, 2020 | 15 | 2020 |
Visual space optimization for zero-shot learning X Wang, S Pang, J Zhu, Z Li, Z Tian, Y Li arXiv preprint arXiv:1907.00330, 2019 | 15 | 2019 |
Cross-speaker emotion transfer based on prosody compensation for end-to-end speech synthesis T Li, X Wang, Q Xie, Z Wang, M Jiang, L Xie arXiv preprint arXiv:2207.01198, 2022 | 13 | 2022 |
ALIGN OR ATTEND? TOWARD MORE EFFICIENT AND ACCURATE SPOKEN WORD DISCOVERY USING SPEECH-TO-IMAGE RETRIEVAL L Wang, X Wang, M Hasegawa-Johnson, O Scharenborg, N Dehak IEEE International Conference on Acoustics, Speech and Signal Processing …, 2020 | 12 | 2020 |
Multi-speaker multi-style text-to-speech synthesis with single-speaker single-style training data scenarios Q Xie, T Li, X Wang, Z Wang, L Xie, G Yu, G Wan 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | 11 | 2022 |
Learn2sing 2.0: Diffusion and mutual information-based target speaker svs by learning from singing teacher H Xue, X Wang, Y Zhang, L Xie, P Zhu, M Bi arXiv preprint arXiv:2203.16408, 2022 | 9 | 2022 |
Controllable crossspeaker emotion transfer for end-to-end speech synthesis T Li, X Wang, Q Xie, Z Wang, L Xie arXiv preprint arXiv:2109.06733, 2021 | 8 | 2021 |
Show and speak: Directly synthesize spoken description of images X Wang, S Feng, J Zhu, M Hasegawa-Johnson, O Scharenborg IEEE International Conference on Acoustics, Speech and Signal Processing, 2020 | 8 | 2020 |
Synthesizing spoken descriptions of images X Wang, J Van Der Hout, J Zhu, M Hasegawa-Johnson, O Scharenborg IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3242-3254, 2021 | 7 | 2021 |
AdaVITS: Tiny VITS for low computing resource speaker adaptation K Song, H Xue, X Wang, J Cong, Y Zhang, L Xie, B Yang, X Zhang, D Su 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | 6 | 2022 |
Domain segmentation and adjustment for generalized zero-shot learning X Wang, S Pang, J Zhu arXiv preprint arXiv:2002.00226, 2020 | 6 | 2020 |
UniSyn: an end-to-end unified model for text-to-speech and singing voice synthesis Y Lei, S Yang, X Wang, Q Xie, J Yao, L Xie, D Su Proceedings of the AAAI Conference on Artificial Intelligence 37 (11), 13025 …, 2023 | 5 | 2023 |