Follow
Chenglin Xu
Chenglin Xu
Kuaishou Technology, China
Verified email at kuaishou.com
Title
Cited by
Cited by
Year
Spex: Multi-scale time domain speaker extraction network
C Xu, W Rao, ES Chng, H Li
IEEE/ACM transactions on audio, speech, and language processing 28, 1370-1384, 2020
1702020
Spex+: A complete time domain speaker extraction network
M Ge, C Xu, L Wang, ES Chng, J Dang, H Li
arXiv preprint arXiv:2005.04686, 2020
1432020
Progressive tandem learning for pattern recognition with deep spiking neural networks
J Wu, C Xu, X Han, D Zhou, M Zhang, H Li, KC Tan
IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (11), 7824 …, 2021
1242021
SINGLE CHANNEL SPEECH SEPARATION WITH CONSTRAINED UTTERANCE LEVEL PERMUTATION INVARIANT TRAINING USING GRID LSTM
C XU, WEI RAO, X XIAO, ENGS CHNG, H LI
862018
Optimization of speaker extraction neural network with magnitude and temporal spectrum approximation loss
C Xu, W Rao, ES Chng, H Li
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
632019
Time-Domain Speaker Extraction Network
X Chenglin, R Wei, C Eng Siong, L Haizhou
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
592019
A deep neural network approach for sentence boundary detection in broadcast news.
C Xu, L Xie, G Huang, X Xiao, E Chng, H Li
INTERSPEECH, 2887-2891, 2014
512014
A study of learning based beamforming methods for speech recognition
X Xiao, C Xu, Z Zhang, S Zhao, S Sun, S Watanabe, L Wang, L Xie, ...
CHiME 2016 workshop, 26-31, 2016
492016
Multi-stage speaker extraction with utterance and frame-level reference signals
M Ge, C Xu, L Wang, ES Chng, J Dang, H Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
462021
Muse: Multi-modal target speaker extraction with visual cues
Z Pan, R Tao, C Xu, H Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
462021
Target speaker extraction for overlapped multi-talker speaker verification
W Rao, C Xu, ES Chng, H Li
arXiv preprint arXiv:1902.02546, 2019
442019
Selective listening by synchronizing speech with lips
Z Pan, R Tao, C Xu, H Li
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1650-1664, 2022
422022
Representation learning with spectro-temporal-channel attention for speech emotion recognition
L Guo, L Wang, C Xu, J Dang, ES Chng, H Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
362021
The 2015 nist language recognition evaluation: the shared view of i2r, fantastic4 and singams
KA Lee, H Li, L Deng, V Hautamäki, W Rao, X Xiao, A Larcher, H Sun, ...
Interspeech 2016 2016, 3211-3215, 2016
302016
Universal Speaker Extraction in the Presence and Absence of Target Speakers for Speech of One and Two Talkers.
M Borsdorf, C Xu, H Li, T Schultz
Interspeech, 1469-1473, 2021
282021
Target speaker verification with selective auditory attention for single and multi-talker speech
C Xu, W Rao, J Wu, H Li
IEEE/ACM Transactions on audio, speech, and language processing 29, 2696-2709, 2021
282021
Neural Speaker Extraction with Speaker-Speech Cross-Attention Network.
W Wang, C Xu, M Ge, H Li
Interspeech, 3535-3539, 2021
282021
A bidirectional lstm approach with word embeddings for sentence boundary detection
C Xu, L Xie, X Xiao
Journal of Signal Processing Systems 90, 1063-1075, 2018
262018
The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016
KA Lee, V Hautamäki, T Kinnunen, A Larcher, C Zhang, A Nautsch, ...
Interspeech, 1328-1332, 2017
262017
Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source.
C Xu, X Xiao, S Sun, W Rao, ES Chng, H Li
Interspeech, 1894-1898, 2017
242017
The system can't perform the operation now. Try again later.
Articles 1–20