Temporal action detection using a statistical language model A Richard, J Gall Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 257 | 2016 |
Weakly supervised action learning with rnn based fine-to-coarse modeling A Richard, H Kuehne, J Gall Proceedings of the IEEE conference on Computer Vision and Pattern …, 2017 | 247 | 2017 |
When will you do what?-anticipating temporal occurrences of activities Y Abu Farha, A Richard, J Gall Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 215 | 2018 |
Meshtalk: 3d face animation from speech using cross-modality disentanglement A Richard, M Zollhöfer, Y Wen, F De la Torre, Y Sheikh Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 192 | 2021 |
Neuralnetwork-viterbi: A framework for weakly supervised video learning A Richard, H Kuehne, A Iqbal, J Gall Proceedings of the IEEE conference on Computer Vision and Pattern …, 2018 | 159 | 2018 |
Conditional diffusion probabilistic model for speech enhancement YJ Lu, ZQ Wang, S Watanabe, A Richard, C Yu, Y Tsao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 147 | 2022 |
Weakly supervised learning of actions from transcripts H Kuehne, A Richard, J Gall Computer Vision and Image Understanding 163, 78-89, 2017 | 140 | 2017 |
Action sets: Weakly supervised action segmentation without ordering constraints A Richard, H Kuehne, J Gall Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018 | 106 | 2018 |
A hybrid rnn-hmm approach for weakly supervised temporal action segmentation H Kuehne, A Richard, J Gall IEEE transactions on pattern analysis and machine intelligence 42 (4), 765-779, 2018 | 102 | 2018 |
Mean-normalized stochastic gradient for large-scale deep learning S Wiesler, A Richard, R Schlüter, H Ney 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 85 | 2014 |
Audio-and gaze-driven facial animation of codec avatars A Richard, C Lea, S Ma, J Gall, F De la Torre, Y Sheikh Proceedings of the IEEE/CVF winter conference on applications of computer …, 2021 | 83 | 2021 |
RASR/NN: The RWTH neural network toolkit for speech recognition S Wiesler, A Richard, P Golik, R Schlüter, H Ney 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 61 | 2014 |
Neural Synthesis of Binaural Speech From Mono Audio A Richard, D Markovic, ID Gebru, S Krenn, GA Butler, F Torre, Y Sheikh International Conference on Learning Representations, 2021 | 59 | 2021 |
Multiface: A dataset for neural face rendering C Wuu, N Zheng, S Ardisson, R Bali, D Belko, E Brockmeyer, L Evans, ... arXiv preprint arXiv:2207.11243, 2022 | 58 | 2022 |
Audiodec: An open-source streaming high-fidelity neural audio codec YC Wu, ID Gebru, D Marković, A Richard ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 50 | 2023 |
A bag-of-words equivalent recurrent neural network for action recognition A Richard, J Gall Computer Vision and Image Understanding 156, 79-91, 2017 | 49 | 2017 |
Audio-visual speech codecs: Rethinking audio-visual speech enhancement by re-synthesis K Yang, D Marković, S Krenn, V Agrawal, A Richard Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 37 | 2022 |
Deep impulse responses: Estimating and parameterizing filters with deep networks A Richard, P Dodds, VK Ithapu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 34 | 2022 |
Implicit hrtf modeling using temporal convolutional networks ID Gebru, D Marković, A Richard, S Krenn, GA Butler, F De la Torre, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 28 | 2021 |
Novel-view acoustic synthesis C Chen, A Richard, R Shapovalov, VK Ithapu, N Neverova, K Grauman, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 19 | 2023 |