Follow
Xinsheng Wang
Title
Cited by
Cited by
Year
Opencpop: A high-quality open source chinese popular song corpus for singing voice synthesis
Y Wang, X Wang, P Zhu, J Wu, H Li, H Xue, Y Zhang, L Xie, M Bi
Interspeech 2022, 2022
862022
Msemotts: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis
Y Lei, S Yang, X Wang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 853-864, 2022
692022
Experimental study on the relation between internal flow and flashing spray characteristics of R134a using straight tube nozzles
XS Wang, B Chen, R Wang, H Xin, ZF Zhou
International Journal of Heat and Mass Transfer 115, 524-536, 2017
422017
Cross-speaker emotion disentangling and transfer for end-to-end speech synthesis
T Li, X Wang, Q Xie, Z Wang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1448-1460, 2022
352022
Generating images from spoken descriptions
X Wang, T Qiao, J Zhu, A Hanjalic, O Scharenborg
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 850-865, 2021
192021
Atomization and surface heat transfer characteristics of cryogen spray cooling with expansion-chambered nozzles
XS Wang, B Chen, ZF Zhou
International Journal of Heat and Mass Transfer 121, 15-27, 2018
182018
Numerical simulation of cryogen spray cooling by a three-dimensional hybrid vortex method
R Wang, B Chen, XS Wang
Applied Thermal Engineering 119, 319-330, 2017
182017
Anyonenet: Synchronized speech and talking head generation for arbitrary persons
X Wang, Q Xie, J Zhu, L Xie, O Scharenborg
IEEE Transactions on Multimedia 25, 6717-6728, 2022
162022
S2IGAN: Speech-to-Image Generation via Adversarial Learning
X Wang, T Qiao, J Zhu, A Hanjalic, O Scharenborg
Proc. Interspeech 2020, 2292--2296, 2020
152020
Visual space optimization for zero-shot learning
X Wang, S Pang, J Zhu, Z Li, Z Tian, Y Li
arXiv preprint arXiv:1907.00330, 2019
152019
Cross-speaker emotion transfer based on prosody compensation for end-to-end speech synthesis
T Li, X Wang, Q Xie, Z Wang, M Jiang, L Xie
arXiv preprint arXiv:2207.01198, 2022
132022
ALIGN OR ATTEND? TOWARD MORE EFFICIENT AND ACCURATE SPOKEN WORD DISCOVERY USING SPEECH-TO-IMAGE RETRIEVAL
L Wang, X Wang, M Hasegawa-Johnson, O Scharenborg, N Dehak
IEEE International Conference on Acoustics, Speech and Signal Processing …, 2020
122020
Multi-speaker multi-style text-to-speech synthesis with single-speaker single-style training data scenarios
Q Xie, T Li, X Wang, Z Wang, L Xie, G Yu, G Wan
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
112022
Learn2sing 2.0: Diffusion and mutual information-based target speaker svs by learning from singing teacher
H Xue, X Wang, Y Zhang, L Xie, P Zhu, M Bi
arXiv preprint arXiv:2203.16408, 2022
92022
Controllable crossspeaker emotion transfer for end-to-end speech synthesis
T Li, X Wang, Q Xie, Z Wang, L Xie
arXiv preprint arXiv:2109.06733, 2021
82021
Show and speak: Directly synthesize spoken description of images
X Wang, S Feng, J Zhu, M Hasegawa-Johnson, O Scharenborg
IEEE International Conference on Acoustics, Speech and Signal Processing, 2020
82020
Synthesizing spoken descriptions of images
X Wang, J Van Der Hout, J Zhu, M Hasegawa-Johnson, O Scharenborg
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3242-3254, 2021
72021
AdaVITS: Tiny VITS for low computing resource speaker adaptation
K Song, H Xue, X Wang, J Cong, Y Zhang, L Xie, B Yang, X Zhang, D Su
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
62022
Domain segmentation and adjustment for generalized zero-shot learning
X Wang, S Pang, J Zhu
arXiv preprint arXiv:2002.00226, 2020
62020
UniSyn: an end-to-end unified model for text-to-speech and singing voice synthesis
Y Lei, S Yang, X Wang, Q Xie, J Yao, L Xie, D Su
Proceedings of the AAAI Conference on Artificial Intelligence 37 (11), 13025 …, 2023
52023
The system can't perform the operation now. Try again later.
Articles 1–20