Exploring the best loss function for DNN-based low-latency speech enhancement with temporal convolutional networks Y Koyama, T Vuong, S Uhlich, B Raj arXiv preprint arXiv:2005.11611, 2020 | 58 | 2020 |
Generalized Spoofing Detection Inspired from Audio Generation Artifacts Y Gao, T Vuong, M Elyasi, G Bharaj, R Singh Interspeech 2021, 2021 | 29 | 2021 |
A modulation-domain loss for neural-network-based real-time speech enhancement T Vuong, Y Xia, RM Stern ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 21 | 2021 |
Learnable spectro-temporal receptive fields for robust voice type discrimination T Vuong, Y Xia, R Stern Interspeech 2020, 2020 | 18 | 2020 |
The Application of Learnable STRF Kernels to the 2021 Fearless Steps Phase-03 SAD Challenge. T Vuong, Y Xia, RM Stern Interspeech 2021, 4364-4368, 2021 | 6 | 2021 |
Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction R Sharma, T Vuong, M Lindsey, H Dhamyal, R Singh, B Raj Proceedings of the ICML Expressive Vocalizations Workshop and Competition, 2022 | 3 | 2022 |
Improved Modulation-Domain Loss for Neural-Network-based Speech Enhancement T Vuong, RM Stern Interspeech 2022, 2022 | 3 | 2022 |
Natural language person search using deep reinforcement learning A Shah, T Vuong arXiv preprint arXiv:1809.00365, 2018 | 3 | 2018 |
Unsupervised Voice Type Discrimination Score Adaptation Using X-vector Clusters M Lindsey, T Vuong, RM Stern ICASSP 2023 IEEE International Conference on Acoustics, Speech, and Signal …, 2023 | 2 | 2023 |
Investigating the Important Temporal Modulations for Deep-learning-based Speech Activity Detection T Vuong, N Madaan, R Panda, RM Stern 2022 IEEE Workshop on Spoken Language Technology, 2023 | 2 | 2023 |
AdaBERT-CTC: Leveraging BERT-CTC for text-only domain adaptation in ASR T Vuong, K Mundnich, D Bekal, V Elluru, S Ronanki, S Bodapati Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023 | 1 | 2023 |
Incorporating Modulation Information into Deep Neural Networks for Robust Speech Processing TMT Vuong Carnegie Mellon University, 2023 | 1 | 2023 |
L3DAS22: Exploring Loss Functions for 3D Speech Enhancement T Vuong, M Lindsey, Y Xia, R Stern L3DAS22: Machine Learning for 3D Audio Signal Processing, 2022 | 1 | 2022 |