Tomohiro Nakatani

Cited by

	All	Since 2019
Citations	11718	6568
h-index	54	40
i10-index	208	118

1400

700

350

1050

1999200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202464 27 36 39 36 59 72 81 84 150 158 186 203 273 360 436 408 601 792 923 987 1084 1335 1309 1211 623

Public access

View all

5 articles

2 articles

available

not available

Based on funding mandates

Co-authors

Keisuke KinoshitaResearch Scientist at GoogleVerified email at ieee.org
Marc DelcroixNTT Communication Science LaboratoriesVerified email at ieee.org
Shoko ArakiNTT Communication Science LaboratoriesVerified email at ieee.org
Takuya YoshiokaAssemblyAIVerified email at assemblyai.com
Atsunori OgawaNTT Communication Science LaboratoriesVerified email at ieee.org
Masakiyo FujimotoSenior researcher, National Institute of Information and Communications TechnologyVerified email at nict.go.jp
Nobutaka ItoUniversity of Tokyo, Japan (formerly NTT)Verified email at k.u-tokyo.ac.jp
Rintaro Ikeshita (池下林太郎)NTTVerified email at ieee.org
Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Reinhold Haeb-UmbachProfessor of Communications Engineering, University of PaderbornVerified email at nt.uni-paderborn.de
Takuya HiguchiAppleVerified email at apple.com
Hiroshi SawadaNTT CorporationVerified email at lab.ntt.co.jp
Hiroshi G OkunoProfessor Emeritus, Kyoto University, Adjunct Researcher, Waseda UniversityVerified email at nue.org
Toshio IRINOProfessor of Systems Engineering, Wakayama UniversityVerified email at sys.wakayama-u.ac.jp
Atsushi NakamuraGraduate School of Natural Sciences, Nagoya City UniversityVerified email at ieee.org
Mehrez SoudenSr. Manager, Apple Inc.Verified email at gatech.edu
Walter KellermannUniversity Erlangen-NurembergVerified email at LNT.de
Takaaki HoriAppleVerified email at apple.com
Kateřina ŽmolíkováResearch scientist, MetaVerified email at meta.com
Armin SehrOTH RegensburgVerified email at oth-regensburg.de

Tomohiro Nakatani

NTT Communication Science Laboratories

Verified email at ieee.org - Homepage

Statistical audio signal processing Computational auditory scene analysis


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Speech dereverberation based on variance-normalized delayed linear prediction T Nakatani, T Yoshioka, K Kinoshita, M Miyoshi, BH Juang IEEE Transactions on Audio, Speech, and Language Processing 18 (7), 1717-1731, 2010	486	2010
The reverb challenge: A common evaluation framework for dereverberation and recognition of reverberant speech K Kinoshita, M Delcroix, T Yoshioka, T Nakatani, A Sehr, W Kellermann, ... Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE …, 2013	459	2013
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research K Kinoshita, M Delcroix, S Gannot, EA P. Habets, R Haeb-Umbach, ... EURASIP Journal on Advances in Signal Processing 2016, 1-19, 2016	406	2016
Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition T Yoshioka, A Sehr, M Delcroix, K Kinoshita, R Maas, T Nakatani, ... IEEE Signal Processing Magazine 29 (6), 114-126, 2012	331	2012
Generalization of multi-channel linear prediction methods for blind MIMO impulse response shortening T Yoshioka, T Nakatani IEEE Transactions on Audio, Speech, and Language Processing 20 (10), 2707-2720, 2012	305	2012
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices T Yoshioka, N Ito, M Delcroix, A Ogawa, K Kinoshita, M Fujimoto, C Yu, ... 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015	261	2015
Robust MVDR beamforming using time-frequency masks for online/offline ASR in noise T Higuchi, N Ito, T Yoshioka, T Nakatani 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016	258	2016
Improving transformer-based end-to-end speech recognition with connectionist temporal classification and language model integration T Nakatani proc. INTERSPEECH 2019, 1408-1412, 2019	254	2019
Suppression of late reverberation effect on speech signal using long-term multiple-step linear prediction K Kinoshita, M Delcroix, T Nakatani, M Miyoshi IEEE transactions on audio, speech, and language processing 17 (4), 534-545, 2009	252	2009
Speakerbeam: Speaker aware neural network for target speaker extraction in speech mixtures K Žmolíková, M Delcroix, K Kinoshita, T Ochiai, T Nakatani, L Burget, ... IEEE Journal of Selected Topics in Signal Processing 13 (4), 800-814, 2019	202	2019
Single channel target speaker extraction and recognition with speaker beam M Delcroix, K Zmolikova, K Kinoshita, A Ogawa, T Nakatani 2018 IEEE international conference on acoustics, speech and signal …, 2018	198	2018
Blind separation and dereverberation of speech mixtures by joint optimization T Yoshioka, T Nakatani, M Miyoshi, HG Okuno IEEE Transactions on Audio, Speech, and Language Processing 19 (1), 69-84, 2010	196	2010
Blind speech dereverberation with multi-channel linear prediction based on short time Fourier transform representation T Nakatani, T Yoshioka, K Kinoshita, M Miyoshi, BH Juang 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008	188	2008
Speech processing for digital home assistants: Combining signal processing with deep-learning techniques R Haeb-Umbach, S Watanabe, T Nakatani, M Bacchiani, B Hoffmeister, ... IEEE Signal processing magazine 36 (6), 111-124, 2019	174	2019
Exploring multi-channel features for denoising-autoencoder-based speech enhancement S Araki, T Hayashi, M Delcroix, M Fujimoto, K Takeda, T Nakatani 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015	131	2015
LINEAR PREDICTION-BASED DEREVERBERATION WITH ADVANCED SPEECH ENHANCEMENT AND RECOGNITION TECHNOLOGIES FOR THE REVERB CHALLENGE M Delcroix, T Yoshioka, A Ogawa, Y Kubo, M Fujimoto, N Ito, K Kinoshita, ...	127	2014
Online MVDR beamformer based on complex Gaussian mixture model with spatial prior for noise robust ASR T Higuchi, N Ito, S Araki, T Yoshioka, M Delcroix, T Nakatani IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (4), 780-793, 2017	124	2017
A multichannel MMSE-based framework for speech source separation and noise reduction M Souden, S Araki, K Kinoshita, T Nakatani, H Sawada IEEE Transactions on Audio, Speech, and Language Processing 21 (9), 1913-1928, 2013	122	2013
Speaker-aware neural network based beamformer for speaker extraction in speech mixtures K Žmolíková, M Delcroix, K Kinoshita, T Higuchi, A Ogawa, T Nakatani Interspeech 2017, 2017	120	2017
Exploiting spectro-temporal locality in deep learning based acoustic event detection M Espi, M Fujimoto, K Kinoshita, T Nakatani EURASIP Journal on Audio, Speech, and Music Processing 2015, 1-12, 2015	114	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors