Large context end-to-end automatic speech recognition via extension of hierarchical recurrent encoder-decoder models R Masumura, T Tanaka, T Moriya, Y Shinohara, T Oba, Y Aono ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 33 | 2019 |
Automation of system building for state-of-the-art large vocabulary speech recognition using evolution strategy T Moriya, T Tanaka, T Shinozaki, S Watanabe, K Duh 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 33 | 2015 |
Deep versus wide: An analysis of student architectures for task-agnostic knowledge distillation of self-supervised speech models T Ashihara, T Moriya, K Matsuura, T Tanaka arXiv preprint arXiv:2207.06867, 2022 | 28 | 2022 |
Learning to enhance or not: Neural network-based switching of enhanced and observed signals for overlapping speech recognition H Sato, T Ochiai, M Delcroix, K Kinoshita, N Kamo, T Moriya ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 27 | 2022 |
Self-Distillation for Improving CTC-Transformer-Based ASR Systems. T Moriya, T Ochiai, S Karita, H Sato, T Tanaka, T Ashihara, R Masumura, ... INTERSPEECH, 546-550, 2020 | 24 | 2020 |
Sequence-level consistency training for semi-supervised end-to-end automatic speech recognition R Masumura, M Ihori, A Takashima, T Moriya, A Ando, Y Shinohara ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 24 | 2020 |
Should we always separate?: Switching between enhanced and observed signals for overlapping speech recognition H Sato, T Ochiai, M Delcroix, K Kinoshita, T Moriya, N Kamo arXiv preprint arXiv:2106.00949, 2021 | 22 | 2021 |
Streaming target-speaker ASR with neural transducer T Moriya, H Sato, T Ochiai, M Delcroix, T Shinozaki arXiv preprint arXiv:2209.04175, 2022 | 20 | 2022 |
A Joint End-to-End and DNN-HMM Hybrid Automatic Speech Recognition System with Transferring Sharable Knowledge. T Tanaka, R Masumura, T Moriya, T Oba, Y Aono INTERSPEECH, 2210-2214, 2019 | 20 | 2019 |
Automated structure discovery and parameter tuning of neural network language model based on evolution strategy T Tanaka, T Moriya, T Shinozaki, S Watanabe, T Hori, K Duh 2016 ieee spoken language technology workshop (slt), 665-671, 2016 | 20 | 2016 |
Evolution-strategy-based automation of system development for high-performance speech recognition T Moriya, T Tanaka, T Shinozaki, S Watanabe, K Duh IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (1), 77-88, 2018 | 18 | 2018 |
Kaldi recipe for Japanese spontaneous speech recognition and its evaluation T Moriya, T Shinozaki, S Watanabe Autumn Meeting of ASJ, 7, 2015 | 17 | 2015 |
Leveraging large text corpora for end-to-end speech summarization K Matsuura, T Ashihara, T Moriya, T Tanaka, A Ogawa, M Delcroix, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 15 | 2023 |
Distilling attention weights for CTC-based ASR systems T Moriya, H Sato, T Tanaka, T Ashihara, R Masumura, Y Shinohara ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 15 | 2020 |
Encoder Transfer for Attention-based Acoustic-to-word Speech Recognition. S Ueno, T Moriya, M Mimura, S Sakai, Y Shinohara, Y Yamaguchi, ... INTERSPEECH, 2424-2428, 2018 | 13 | 2018 |
SpeechGLUE: How well can self-supervised speech models capture linguistic knowledge? T Ashihara, T Moriya, K Matsuura, T Tanaka, Y Ijima, T Asami, M Delcroix, ... arXiv preprint arXiv:2306.08374, 2023 | 12 | 2023 |
Neural speech-to-text language models for rescoring hypotheses of dnn-hmm hybrid automatic speech recognition systems T Tanaka, R Masumura, T Moriya, Y Aono 2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018 | 12 | 2018 |
Evolutionary optimization of long short-term memory neural network language model T Tanaka, T Moriya, T Shinozaki, S Watanabe, T Hori, K Duh Journal of the Acoustical Society of America 140 (4_Supplement), 3062-3062, 2016 | 12 | 2016 |
Disfluency detection based on speech-aware token-by-token sequence labeling with blstm-crfs and attention mechanisms T Tanaka, R Masumura, T Moriya, T Oba, Y Aono 2019 Asia-Pacific Signal and Information Processing Association Annual …, 2019 | 11 | 2019 |
Neural Whispered Speech Detection with Imbalanced Learning. T Ashihara, Y Shinohara, H Sato, T Moriya, K Matsui, T Fukutomi, ... INTERSPEECH, 3352-3356, 2019 | 11 | 2019 |