Follow
Ahmet Ustun
Ahmet Ustun
Cohere For AI
Verified email at cohere.com - Homepage
Title
Cited by
Cited by
Year
Aya model: An instruction finetuned open-access multilingual language model
A Üstün, V Aryabumi, ZX Yong, WY Ko, D D'souza, G Onilude, N Bhandari, ...
arXiv preprint arXiv:2402.07827, 2024
1242024
UDapter: Language Adaptation for Truly Universal Dependency Parsing
A Üstün, A Bisazza, G Bouma, G van Noord
Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020
1242020
Massive choice, ample tasks (MaChAmp): A toolkit for multi-task learning in NLP
R Van Der Goot, A Üstün, A Ramponi, I Sharaf, B Plank
arXiv preprint arXiv:2005.14672, 2020
1012020
Back to basics: Revisiting reinforce style optimization for learning from human feedback in llms
A Ahmadian, C Cremer, M Gallé, M Fadaee, J Kreutzer, O Pietquin, ...
arXiv preprint arXiv:2402.14740, 2024
862024
Pushing mixture of experts to the limit: Extremely parameter efficient moe for instruction tuning
T Zadouri, A Üstün, A Ahmadian, B Ermiş, A Locatelli, S Hooker
arXiv preprint arXiv:2309.05444, 2023
832023
When less is more: Investigating data pruning for pretraining llms at scale
M Marion, A Üstün, L Pozzobon, A Wang, M Fadaee, S Hooker
arXiv preprint arXiv:2309.04564, 2023
822023
Aya dataset: An open-access collection for multilingual instruction tuning
S Singh, F Vargus, D Dsouza, BF Karlsson, A Mahendiran, WY Ko, ...
arXiv preprint arXiv:2402.06619, 2024
672024
Aya 23: Open weight releases to further multilingual progress
V Aryabumi, J Dang, D Talupuru, S Dash, D Cairuz, H Lin, B Venkitesh, ...
arXiv preprint arXiv:2405.15032, 2024
612024
Multilingual unsupervised neural machine translation with denoising adapters
A Üstün, A Berard, L Besacier, M Gallé
arXiv preprint arXiv:2110.10472, 2021
462021
Characters or morphemes: How to represent words?
A Üstün, M Kurfalı, B Can
Association for Computational Linguistics, 2018
452018
Siti Oryza Khairunnisa, Mamoru Komachi, and Barbara Plank. 2021. From masked language modeling to translation: Non-English auxiliary tasks improve zero-shot spoken language …
R Van Der Goot, I Sharaf, A Imankulova, A Üstün, M Stepanovic, ...
Proceedings of the 2021 Conference of the North American Chapter of the …, 2021
422021
Automatic judgement forecasting for pending applications of the European Court of Human Rights
M Medvedeva, A Üstün, X Xu, M Vols, M Wieling
Proceedings of the Fifth Workshop on Automatec Semantic Analysis of …, 2021
332021
Intriguing properties of quantization at scale
A Ahmadian, S Dash, H Chen, B Venkitesh, ZS Gou, P Blunsom, A Üstün, ...
Advances in Neural Information Processing Systems 36, 34278-34294, 2023
322023
Hyper-X: A unified hypernetwork for multi-task multilingual transfer
A Üstün, A Bisazza, G Bouma, G van Noord, S Ruder
arXiv preprint arXiv:2205.12148, 2022
292022
Unsupervised morphological segmentation using neural word embeddings
A Üstün, B Can
Statistical Language and Speech Processing: 4th International Conference …, 2016
202016
From masked language modeling to translation: Non-English auxiliary tasks improve zero-shot spoken language understanding
R Van Der Goot, I Sharaf, A Imankulova, A Üstün, M Stepanović, ...
arXiv preprint arXiv:2105.07316, 2021
152021
UDapter: Typology-based language adapters for multilingual dependency parsing and sequence labeling
A Üstün, A Bisazza, G Bouma, G Noord
Computational Linguistics 48 (3), 555-592, 2022
112022
Turkish POS tagging by reducing sparsity with morpheme tags in small datasets
B Can, A Üstün, M Kurfalı
Computational Linguistics and Intelligent Text Processing: 17th …, 2018
112018
On the difficulty of translating free-order case-marking languages
A Bisazza, A Üstün, S Sportel
Transactions of the Association for Computational Linguistics 9, 1233-1248, 2021
102021
When does Parameter-Efficient Transfer Learning Work for Machine Translation?
A Üstün, AC Stickland
arXiv preprint arXiv:2205.11277, 2022
82022
The system can't perform the operation now. Try again later.
Articles 1–20