‪Daniel Simig‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2019
Citations	2488	2486
h-index	8	8
i10-index	8	8

0

1200

600

300

900

202220232024197 1161 1114

Daniel Simig

Daniel Simig

Cohere

Verified email at cohere.com


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Opt: Open pre-trained transformer language models S Zhang, S Roller, N Goyal, M Artetxe, M Chen, S Chen, C Dewan, ... arXiv preprint arXiv:2205.01068, 2022	2014	2022
Few-shot learning with multilingual language models XV Lin, T Mihaylov, M Artetxe, T Wang, S Chen, D Simig, M Ott, N Goyal, ... arXiv preprint arXiv:2112.10668, 2021	165*	2021
Semdedup: Data-efficient learning at web-scale through semantic deduplication A Abbas, K Tirumala, D Simig, S Ganguli, AS Morcos arXiv preprint arXiv:2303.09540, 2023	86	2023
Opt-iml: Scaling language model instruction meta learning through the lens of generalization S Iyer, XV Lin, R Pasunuru, T Mihaylov, D Simig, P Yu, K Shuster, T Wang, ... arXiv preprint arXiv:2212.12017, 2022	76	2022
MEGABYTE: modeling million-byte sequences with multiscale transformers L Yu, D Simig, C Flaherty, A Aghajanyan, L Zettlemoyer, M Lewis Proceedings of the 37th International Conference on Neural Information …, 2023	60*	2023
D4: Improving llm pretraining via document de-duplication and diversification K Tirumala, D Simig, A Aghajanyan, A Morcos Advances in Neural Information Processing Systems 36, 2024	39	2024
Understanding in-context learning via supportive pretraining data X Han, D Simig, T Mihaylov, Y Tsvetkov, A Celikyilmaz, T Wang arXiv preprint arXiv:2306.15091, 2023	27	2023
Open vocabulary extreme classification using generative models D Simig, F Petroni, P Yanki, K Popat, C Du, S Riedel, M Yazdani arXiv preprint arXiv:2205.05812, 2022	15	2022
Text characterization toolkit (TCT) D Simig, T Wang, V Dankers, P Henderson, K Batsuren, D Hupkes, ... Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the …, 2022	6*	2022
Evaluating end-to-end entity linking on domain-specific knowledge bases: Learning about ancient technologies from museum collections S Cadavid-Sanchez, K Kacem, RAM Frade, J Boehm, T Chaney, ... arXiv preprint arXiv:2305.14588, 2023		2023
Turning Flows into Trees: Graph Analytics for Aerodynamic Flows D Simig, P Kelly		2016
Natural Language to Neural Programs D Simig

The system can't perform the operation now. Try again later.

Articles 1–12