Florian Metze

Cited by

	All	Since 2019
Citations	11840	6921
h-index	54	38
i10-index	199	124

1700

850

425

1275

2002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202458 87 99 107 133 120 108 137 148 176 236 351 473 562 647 638 743 853 905 1086 1473 1656 940

Public access

View all

45 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Alexander WaibelCarnegie Mellon, KIT, Karlsruhe Institute of Technology, University of KarlsruheVerified email at cs.cmu.edu
Yajie MiaoCarnegie Mellon UniversityVerified email at cs.cmu.edu
Tanja SchultzProfessor of Computer Science, University BremenVerified email at uni-bremen.de
Billy li (Juncheng)Carnegie Mellon UniversityVerified email at cs.cmu.edu
Alan W BlackProfessor, Language Technologies Institute, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Shruti PalaskarAppleVerified email at apple.com
Ramon SanabriaThe University of EdinburghVerified email at ed.ac.uk
Hagen SoltauGoogle DeepMindVerified email at google.com
Siddharth DalmiaResearch Scientist, Google DeepMindVerified email at google.com
Tim PolzehlGerman Research Center for Artificial IntelligenceVerified email at dfki.de
Po-Yao (Bernie) HuangFAIR, MetaVerified email at fb.com
Xinjian LiGoogleVerified email at google.com
Alex HauptmannCarnegie Mellon UniversityVerified email at cs.cmu.edu
Yun Wang (Maigo)Research Scientist at FacebookVerified email at fb.com
Shourabh RawatCarnegie Mellon University (CMU)Verified email at cs.cmu.edu
Shuhui QuStanford UniversityVerified email at stanford.edu
Xavier AngueraELSA Corp.Verified email at elsanow.io
Sebastian StükerZoom Video Communications Inc.Verified email at kit.edu
Thomas SchaafCarnegie Mellon UniversityVerified email at cs.cmu.edu
Christoph FeichtenhoferMeta, FAIRVerified email at fb.com

Florian Metze

Carnegie Mellon University; Meta AI

Verified email at andrew.cmu.edu - Homepage

speech recognition video understanding


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding Y Miao, M Gowayyed, F Metze 2015 IEEE workshop on automatic speech recognition and understanding (ASRU …, 2015	949	2015
Videoclip: Contrastive pre-training for zero-shot video-text understanding H Xu, G Ghosh, PY Huang, D Okhonko, A Aghajanyan, F Metze, ... arXiv preprint arXiv:2109.14084, 2021	452	2021
Extracting deep bottleneck features using stacked auto-encoders J Gehring, Y Miao, F Metze, A Waibel 2013 IEEE international conference on acoustics, speech and signal …, 2013	377	2013
Learning joint embedding with multimodal cues for cross-modal video-text retrieval NC Mithun, J Li, F Metze, AK Roy-Chowdhury Proceedings of the 2018 ACM on international conference on multimedia …, 2018	284	2018
How2: a large-scale dataset for multimodal language understanding R Sanabria, O Caglayan, S Palaskar, D Elliott, L Barrault, L Specia, ... arXiv preprint arXiv:1811.00347, 2018	278	2018
Support-set bottlenecks for video-text representation learning M Patrick, PY Huang, Y Asano, F Metze, A Hauptmann, J Henriques, ... arXiv preprint arXiv:2010.02824, 2020	265	2020
A one-pass decoder based on polymorphic linguistic context assignment H Soltau, F Metze, C Fugen, A Waibel IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU …, 2001	251	2001
Keeping your eye on the ball: Trajectory attention in video transformers M Patrick, D Campbell, Y Asano, I Misra, F Metze, C Feichtenhofer, ... Advances in neural information processing systems 34, 12493-12506, 2021	242	2021
A comparison of five multiple instance learning pooling functions for sound event detection with weak labeling Y Wang, J Li, F Metze ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	204	2019
Comparison of four approaches to age and gender recognition for telephone applications F Metze, J Ajmera, R Englert, U Bub, F Burkhardt, J Stegmann, C Muller, ... 2007 IEEE International Conference on Acoustics, Speech and Signal …, 2007	196	2007
A comparison of deep learning methods for environmental sound detection J Li, W Dai, F Metze, S Qu, S Das 2017 IEEE International conference on acoustics, speech and signal …, 2017	185	2017
Advances in automatic meeting record creation and access A Waibel, M Bett, F Metze, K Ries, T Schaaf, T Schultz, H Soltau, H Yu, ... 2001 IEEE International Conference on Acoustics, Speech, and Signal …, 2001	180	2001
Masked autoencoders that listen PY Huang, H Xu, J Li, A Baevski, M Auli, W Galuba, F Metze, ... Advances in Neural Information Processing Systems 35, 28708-28720, 2022	169	2022
How2sign: a large-scale multimodal dataset for continuous american sign language A Duarte, S Palaskar, L Ventura, D Ghadiyaram, K DeHaan, F Metze, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021	167	2021
Session independent non-audible speech recognition using surface electromyography L Maier-Hein, F Metze, T Schultz, A Waibel IEEE Workshop on Automatic Speech Recognition and Understanding, 2005., 331-336, 2005	167	2005
Speaker adaptive training of deep neural network acoustic models using i-vectors Y Miao, H Zhang, F Metze IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (11 …, 2015	144	2015
Effective dimensionality reduction for word embeddings V Raunak, V Gupta, F Metze Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP …, 2019	129	2019
Anger recognition in speech using acoustic and linguistic cues T Polzehl, A Schmitt, F Metze, M Wagner Speech Communication 53 (9-10), 1198-1209, 2011	127	2011
Deep maxout networks for low-resource speech recognition Y Miao, F Metze, S Rawat 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 398-403, 2013	126	2013
Vlm: Task-agnostic video-language model pre-training for video understanding H Xu, G Ghosh, PY Huang, P Arora, M Aminzadeh, C Feichtenhofer, ... arXiv preprint arXiv:2105.09996, 2021	124	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors