Wenet: Production oriented streaming and non-streaming end-to-end speech recognition toolkit Z Yao, D Wu, X Wang, B Zhang, F Yu, C Yang, Z Peng, X Chen, L Xie, ... arXiv preprint arXiv:2102.01547, 2021 | 247 | 2021 |
Unified streaming and non-streaming two-pass end-to-end model for speech recognition B Zhang, D Wu, Z Yao, X Wang, F Yu, C Yang, L Guo, Y Hu, L Xie, X Lei arXiv preprint arXiv:2012.05481, 2020 | 77 | 2020 |
Adversarial examples for improving end-to-end attention-based small-footprint keyword spotting X Wang, S Sun, C Shan, J Hou, L Xie, S Li, X Lei ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 46 | 2019 |
Accent and speaker disentanglement in many-to-many voice conversion Z Wang, W Ge, X Wang, S Yang, W Gan, H Chen, H Li, L Xie, X Li 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 35 | 2021 |
Cascade rnn-transducer: Syllable based streaming on-device mandarin speech recognition with a syllable-to-character converter X Wang, Z Yao, X Shi, L Xie 2021 IEEE Spoken Language Technology Workshop (SLT), 15-21, 2021 | 31 | 2021 |
The SLT 2021 children speech recognition challenge: Open datasets, rules and baselines F Yu, Z Yao, X Wang, K An, L Xie, Z Ou, B Liu, X Li, G Miao 2021 IEEE Spoken Language Technology Workshop (SLT), 1117-1123, 2021 | 19 | 2021 |
Efficient conformer with prob-sparse attention mechanism for end-to-endspeech recognition X Wang, S Sun, L Xie, L Ma arXiv preprint arXiv:2106.09236, 2021 | 18 | 2021 |
Virtual adversarial training for ds-cnn based small-footprint keyword spotting X Wang, S Sun, L Xie 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 12 | 2019 |
Two stage contextual word filtering for context bias in unified streaming and non-streaming transducer Z Yang, S Sun, X Wang, Y Zhang, L Ma, L Xie arXiv preprint arXiv:2301.06735, 2023 | 8 | 2023 |
CaTT-KWS: A Multi-stage Customized Keyword Spotting Framework based on Cascaded Transducer-Transformer Z Yang, S Sun, J Li, X Zhang, X Wang, L Ma, L Xie arXiv preprint arXiv:2207.01267, 2022 | 8 | 2022 |
Ieee slt 2021 alpha-mini speech challenge: Open datasets, tracks, rules and baselines Y Fu, Z Yao, W He, J Wu, X Wang, Z Yang, S Zhang, L Xie, D Huang, H Bu, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 1101-1108, 2021 | 5 | 2021 |
VITA: Towards Open-Source Interactive Omni Multimodal LLM C Fu, H Lin, Z Long, Y Shen, M Zhao, Y Zhang, X Wang, D Yin, L Ma, ... arXiv preprint arXiv:2408.05211, 2024 | 4 | 2024 |
DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting S Lv, X Wang, S Sun, L Ma, L Xie arXiv preprint arXiv:2305.12331, 2023 | 2 | 2023 |
Minimizing Sequential Confusion Error in Speech Command Recognition Z Yang, H Lv, X Wang, A Zhang, L Xie arXiv preprint arXiv:2207.01261, 2022 | 1 | 2022 |
A Transcription Prompt-based Efficient Audio Large Language Model for Robust Speech Recognition Y Li, X Wang, S Cao, Y Zhang, L Ma, L Xie arXiv preprint arXiv:2408.09491, 2024 | | 2024 |