Follow
Tom Bagby
Tom Bagby
Verified email at google.com
Title
Cited by
Cited by
Year
Streaming end-to-end speech recognition for mobile devices
Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
6802019
Location-relative attention mechanisms for robust long-form speech synthesis
E Battenberg, RJ Skerry-Ryan, S Mariooryad, D Stanton, D Kao, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1202020
Semi-supervised generative modeling for controllable speech synthesis
R Habib, S Mariooryad, M Shannon, E Battenberg, RJ Skerry-Ryan, ...
arXiv preprint arXiv:1910.01709, 2019
552019
Effective use of variational embedding capacity in expressive end-to-end speech synthesis
E Battenberg, S Mariooryad, D Stanton, RJ Skerry-Ryan, M Shannon, ...
arXiv preprint arXiv:1906.03402, 2019
532019
Efficient implementation of recurrent neural network transducer in tensorflow
T Bagby, K Rao, KC Sim
2018 IEEE Spoken Language Technology Workshop (SLT), 506-512, 2018
382018
End-to-End Training of Acoustic Models for Large Vocabulary Continuous Speech Recognition with TensorFlow.
E Variani, T Bagby, E McDermott, M Bacchiani
Interspeech, 1641-1645, 2017
312017
Speaker generation
D Stanton, M Shannon, S Mariooryad, RJ Skerry-Ryan, E Battenberg, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
272022
Non-saturating GAN training as divergence minimization
M Shannon, B Poole, S Mariooryad, T Bagby, E Battenberg, D Kao, ...
arXiv preprint arXiv:2010.08029, 2020
152020
Complex evolution recurrent neural networks (cernns)
I Shafran, T Bagby, RJ Skerry-Ryan
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
122018
Improving the efficiency of forward-backward algorithm using batched computation in TensorFlow
KC Sim, A Narayanan, T Bagby, TN Sainath, M Bacchiani
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
122017
Sampled connectionist temporal classification
E Variani, T Bagby, K Lahouel, E McDermott, M Bacchiani
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
82018
Learning the joint distribution of two sequences using little or no paired data
S Mariooryad, M Shannon, S Ma, T Bagby, D Kao, D Stanton, ...
arXiv preprint arXiv:2212.03232, 2022
12022
Generative semi-supervised learning with a neural seq2seq noisy channel
S Mariooryad, M Shannon, S Ma, T Bagby, DTH Kao, D Stanton, ...
ICML 2023 Workshop on Structured Probabilistic Inference {\&} Generative …, 2023
2023
Last: Scalable Lattice-Based Speech Modelling in Jax
K Wu, E Variani, T Bagby, M Riley
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–14