Speech dereverberation based on variance-normalized delayed linear prediction T Nakatani, T Yoshioka, K Kinoshita, M Miyoshi, BH Juang IEEE Transactions on Audio, Speech, and Language Processing 18 (7), 1717-1731, 2010 | 506 | 2010 |
The REVERB challenge: A common evaluation framework for dereverberation and recognition of reverberant speech K Kinoshita, M Delcroix, T Yoshioka, T Nakatani, E Habets, ... 2013 IEEE Workshop on Applications of Signal Processing to Audio and …, 2013 | 473 | 2013 |
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research K Kinoshita, M Delcroix, S Gannot, E Habets, R Haeb-Umbach, ... EURASIP Journal on Advances in Signal Processing, 2016 | 423 | 2016 |
Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition T Yoshioka, A Sehr, M Delcroix, K Kinoshita, R Maas, T Nakatani, ... IEEE Signal Processing Magazine 29 (6), 114-126, 2012 | 338 | 2012 |
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices T Yoshioka, N Ito, M Delcroix, A Ogawa, K Kinoshita, M Fujimoto, C Yu, ... 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 269 | 2015 |
Suppression of late reverberation effect on speech signal using long-term multiple-step linear prediction K Kinoshita, M Delcroix, T Nakatani, M Miyoshi IEEE transactions on audio, speech, and language processing 17 (4), 534-545, 2009 | 253 | 2009 |
Speakerbeam: Speaker aware neural network for target speaker extraction in speech mixtures K Žmolíková, M Delcroix, K Kinoshita, T Ochiai, T Nakatani, L Burget, ... IEEE Journal of Selected Topics in Signal Processing 13 (4), 800-814, 2019 | 227 | 2019 |
Single channel target speaker extraction and recognition with speaker beam M Delcroix, K Zmolikova, K Kinoshita, A Ogawa, T Nakatani 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 214 | 2018 |
Blind speech dereverberation with multi-channel linear prediction based on short time Fourier transform representation T Nakatani, T Yoshioka, K Kinoshita, M Miyoshi, BH Juang 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 195 | 2008 |
Linear prediction-based dereverberation with advanced speech enhancement and recognition technologies for the REVERB challenge M Delcroix, T Yoshioka, A Ogawa, Y Kubo, M Fujimoto, N Ito, K Kinoshita, ... Reverb workshop, 2014 | 128 | 2014 |
A multichannel MMSE-based framework for speech source separation and noise reduction M Souden, S Araki, K Kinoshita, T Nakatani, H Sawada IEEE Transactions on Audio, Speech, and Language Processing 21 (9), 1913-1928, 2013 | 127 | 2013 |
Speaker-aware neural network based beamformer for speaker extraction in speech mixtures K Žmolíková, M Delcroix, K Kinoshita, T Higuchi, A Ogawa, T Nakatani Proc. Interspeech 2017, 2655-2659, 2017 | 123 | 2017 |
Improving speaker discrimination of target speech extraction with time-domain speakerbeam M Delcroix, T Ochiai, K Zmolikova, K Kinoshita, N Tawara, T Nakatani, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 122 | 2020 |
Exploiting spectro-temporal locality in deep learning based acoustic event detection M Espi, M Fujimoto, K Kinoshita, T Nakatani EURASIP Journal on Audio, Speech, and Music Processing 2015, 1-12, 2015 | 118 | 2015 |
Neural Network-Based Spectrum Estimation for Online WPE Dereverberation. K Kinoshita, M Delcroix, H Kwon, T Mori, T Nakatani Interspeech, 384-388, 2017 | 117 | 2017 |
All-neural online source separation, counting, and diarization for meeting analysis T Von Neumann, K Kinoshita, M Delcroix, S Araki, T Nakatani, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 114 | 2019 |
Improving noise robust automatic speech recognition with single-channel time-domain enhancement network K Kinoshita, T Ochiai, M Delcroix, T Nakatani ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020 | 107 | 2020 |
Low-latency real-time meeting recognition and understanding using distant microphones and omni-directional camera T Hori, S Araki, T Yoshioka, M Fujimoto, S Watanabe, T Oba, A Ogawa, ... IEEE transactions on audio, speech, and language processing 20 (2), 499-513, 2011 | 106 | 2011 |
Harmonicity-based blind dereverberation for single-channel speech signals T Nakatani, K Kinoshita, M Miyoshi IEEE Transactions on Audio, Speech, and Language Processing 15 (1), 80-95, 2006 | 100 | 2006 |
Integrating end-to-end neural and clustering-based diarization: Getting the best of both worlds K Kinoshita, M Delcroix, N Tawara ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 95 | 2021 |