Follow
Hayato Futami
Hayato Futami
Sony Group Corporation
Verified email at sony.com
Title
Cited by
Cited by
Year
Distilling the knowledge of BERT for sequence-to-sequence ASR
H Futami, H Inaguma, S Ueno, M Mimura, S Sakai, T Kawahara
arXiv preprint arXiv:2008.03822, 2020
532020
Asr rescoring and confidence estimation with electra
H Futami, H Inaguma, M Mimura, S Sakai, T Kawahara
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
192021
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
H Futami, H Inaguma, S Ueno, M Mimura, S Sakai, T Kawahara
arXiv preprint arXiv:2209.04062, 2022
82022
Distilling the Knowledge of BERT for CTC-based ASR
H Futami, H Inaguma, M Mimura, S Sakai, T Kawahara
arXiv preprint arXiv:2209.02030, 2022
62022
A study on the integration of pipeline and e2e slu systems for spoken semantic parsing toward stop quality challenge
S Arora, H Futami, SL Wu, J Huynh, Y Peng, Y Kashiwagi, E Tsunoo, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
42023
Streaming joint speech recognition and disfluency detection
H Futami, E Tsunoo, K Shibata, Y Kashiwagi, T Okuda, S Arora, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
42023
Joint modelling of spoken language understanding tasks with integrated dialog history
S Arora, H Futami, E Tsunoo, B Yan, S Watanabe
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
32023
Universlu: Universal spoken language understanding for diverse classification and sequence generation tasks with a single network
S Arora, H Futami, J Jung, Y Peng, R Sharma, Y Kashiwagi, E Tsunoo, ...
arXiv preprint arXiv:2310.02973, 2023
22023
Decoder-only architecture for speech recognition with ctc prompts and text data augmentation
E Tsunoo, H Futami, Y Kashiwagi, S Arora, S Watanabe
arXiv preprint arXiv:2309.08876, 2023
22023
Integrating pretrained ASR and LM to perform sequence generation for spoken language understanding
S Arora, H Futami, Y Kashiwagi, E Tsunoo, B Yan, S Watanabe
arXiv preprint arXiv:2307.11005, 2023
22023
The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge
H Futami, J Huynh, S Arora, SL Wu, Y Kashiwagi, Y Peng, B Yan, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
Tensor decomposition for minimization of E2E SLU model toward on-device processing
Y Kashiwagi, S Arora, H Futami, J Huynh, SL Wu, Y Peng, B Yan, ...
arXiv preprint arXiv:2306.01247, 2023
12023
Phoneme-aware Encoding for Prefix-tree-based Contextual ASR
H Futami, E Tsunoo, Y Kashiwagi, H Ogawa, S Arora, S Watanabe
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
Integration of Frame-and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition
E Tsunoo, H Futami, Y Kashiwagi, S Arora, S Watanabe
arXiv preprint arXiv:2307.12767, 2023
2023
E-Branchformer-Based E2E SLU Toward Stop on-Device Challenge
Y Kashiwagi, S Arora, H Futami, J Huynh, SL Wu, Y Peng, B Yan, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–15