Follow
Tyler Vuong
Tyler Vuong
Verified email at andrew.cmu.edu - Homepage
Title
Cited by
Cited by
Year
Exploring the best loss function for DNN-based low-latency speech enhancement with temporal convolutional networks
Y Koyama, T Vuong, S Uhlich, B Raj
arXiv preprint arXiv:2005.11611, 2020
582020
Generalized Spoofing Detection Inspired from Audio Generation Artifacts
Y Gao, T Vuong, M Elyasi, G Bharaj, R Singh
Interspeech 2021, 2021
292021
A modulation-domain loss for neural-network-based real-time speech enhancement
T Vuong, Y Xia, RM Stern
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
212021
Learnable spectro-temporal receptive fields for robust voice type discrimination
T Vuong, Y Xia, R Stern
Interspeech 2020, 2020
182020
The Application of Learnable STRF Kernels to the 2021 Fearless Steps Phase-03 SAD Challenge.
T Vuong, Y Xia, RM Stern
Interspeech 2021, 4364-4368, 2021
62021
Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction
R Sharma, T Vuong, M Lindsey, H Dhamyal, R Singh, B Raj
Proceedings of the ICML Expressive Vocalizations Workshop and Competition, 2022
32022
Improved Modulation-Domain Loss for Neural-Network-based Speech Enhancement
T Vuong, RM Stern
Interspeech 2022, 2022
32022
Natural language person search using deep reinforcement learning
A Shah, T Vuong
arXiv preprint arXiv:1809.00365, 2018
32018
Unsupervised Voice Type Discrimination Score Adaptation Using X-vector Clusters
M Lindsey, T Vuong, RM Stern
ICASSP 2023 IEEE International Conference on Acoustics, Speech, and Signal …, 2023
22023
Investigating the Important Temporal Modulations for Deep-learning-based Speech Activity Detection
T Vuong, N Madaan, R Panda, RM Stern
2022 IEEE Workshop on Spoken Language Technology, 2023
22023
AdaBERT-CTC: Leveraging BERT-CTC for text-only domain adaptation in ASR
T Vuong, K Mundnich, D Bekal, V Elluru, S Ronanki, S Bodapati
Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023
12023
Incorporating Modulation Information into Deep Neural Networks for Robust Speech Processing
TMT Vuong
Carnegie Mellon University, 2023
12023
L3DAS22: Exploring Loss Functions for 3D Speech Enhancement
T Vuong, M Lindsey, Y Xia, R Stern
L3DAS22: Machine Learning for 3D Audio Signal Processing, 2022
12022
The system can't perform the operation now. Try again later.
Articles 1–13