Follow
Yusuke Fujita
Yusuke Fujita
LY Corp.
Verified email at linecorp.com - Homepage
Title
Cited by
Cited by
Year
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings
S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ...
arXiv preprint arXiv:2004.09249, 2020
3282020
End-to-end neural speaker diarization with self-attention
Y Fujita, N Kanda, S Horiguchi, Y Xue, K Nagamatsu, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
2712019
End-to-end neural speaker diarization with permutation-free objectives
Y Fujita, N Kanda, S Horiguchi, K Nagamatsu, S Watanabe
Interspeech, 4300-4304, 2019
2652019
End-to-end speaker diarization for an unknown number of speakers with encoder-decoder based attractors
S Horiguchi, Y Fujita, S Watanabe, Y Xue, K Nagamatsu
arXiv preprint arXiv:2005.09921, 2020
1902020
Speaker diarization with region proposal network
Z Huang, S Watanabe, Y Fujita, P García, Y Shao, D Povey, S Khudanpur
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
762020
Guided source separation meets a strong ASR backend: Hitachi/Paderborn University joint investigation for dinner party ASR
N Kanda, C Boeddeker, J Heitkaemper, Y Fujita, S Horiguchi, ...
arXiv preprint arXiv:1905.12230, 2019
722019
Encoder-decoder based attractors for end-to-end neural diarization
S Horiguchi, Y Fujita, S Watanabe, Y Xue, P Garcia
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1493-1507, 2022
592022
The Hitachi/JHU CHiME-5 system: Advances in speech recognition for everyday home environments using multiple microphone arrays
N Kanda, R Ikeshita, S Horiguchi, Y Fujita, K Nagamatsu, X Wang, ...
Proc. CHiME-5, 6-10, 2018
542018
Online end-to-end neural diarization with speaker-tracing buffer
Y Xue, S Horiguchi, Y Fujita, S Watanabe, P García, K Nagamatsu
2021 IEEE Spoken Language Technology Workshop (SLT), 841-848, 2021
522021
End-to-end neural diarization: Reformulating speaker diarization as simple multi-label classification
Y Fujita, S Watanabe, S Horiguchi, Y Xue, K Nagamatsu
arXiv preprint arXiv:2003.02966, 2020
512020
Neural speaker diarization with speaker-wise chain rule
Y Fujita, S Watanabe, S Horiguchi, Y Xue, J Shi, K Nagamatsu
arXiv preprint arXiv:2006.01796, 2020
462020
Acoustic modeling for distant multi-talker speech recognition with single-and multi-channel branches
N Kanda, Y Fujita, S Horiguchi, R Ikeshita, K Nagamatsu, S Watanabe
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
452019
End-to-end speaker diarization as post-processing
S Horiguchi, P Garcia, Y Fujita, S Watanabe, K Nagamatsu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
442021
Simultaneous speech recognition and speaker diarization for monaural dialogue recordings with target-speaker acoustic models
N Kanda, S Horiguchi, Y Fujita, Y Xue, K Nagamatsu, S Watanabe
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 31-38, 2019
432019
The Hitachi-JHU DIHARD III system: Competitive end-to-end neural diarization and x-vector clustering systems combined by DOVER-Lap
S Horiguchi, N Yalta, P Garcia, Y Takashima, Y Xue, D Raj, Z Huang, ...
arXiv preprint arXiv:2102.01363, 2021
412021
Auxiliary interference speaker loss for target-speaker speech recognition
N Kanda, S Horiguchi, R Takashima, Y Fujita, K Nagamatsu, S Watanabe
arXiv preprint arXiv:1906.10876, 2019
362019
Lattice-free State-level Minimum Bayes Risk Training of Acoustic Models.
N Kanda, Y Fujita, K Nagamatsu
Interspeech, 2923-2927, 2018
322018
Acoustic modeling for overlapping speech recognition: JHU CHiME-5 challenge system
V Manohar, SJ Chen, Z Wang, Y Fujita, S Watanabe, S Khudanpur
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
292019
Speaker-conditional chain model for speech separation and extraction
J Shi, J Xu, Y Fujita, S Watanabe, B Xu
arXiv preprint arXiv:2006.14149, 2020
282020
Investigation of lattice-free maximum mutual information-based acoustic models with sequence-level Kullback-Leibler divergence
N Kanda, Y Fujita, K Nagamatsu
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 69-76, 2017
282017
The system can't perform the operation now. Try again later.
Articles 1–20