Sparks of artificial general intelligence: Early experiments with gpt-4 S Bubeck, V Chandrasekaran, R Eldan, J Gehrke, E Horvitz, E Kamar, ... arXiv preprint arXiv:2303.12712, 2023 | 3215 | 2023 |
The power of depth for feedforward neural networks R Eldan, O Shamir Conference on learning theory, 907-940, 2016 | 1014 | 2016 |
Phi-3 technical report: A highly capable language model locally on your phone M Abdin, J Aneja, H Awadalla, A Awadallah, AA Awan, N Bach, A Bahree, ... arXiv preprint arXiv:2404.14219, 2024 | 509 | 2024 |
Textbooks are all you need S Gunasekar, Y Zhang, J Aneja, CCT Mendes, A Del Giorno, S Gopi, ... arXiv preprint arXiv:2306.11644, 2023 | 477 | 2023 |
Sparks of artificial general intelligence: Early experiments with GPT-4. arXiv S Bubeck, V Chandrasekaran, R Eldan, J Gehrke, E Horvitz, E Kamar, ... arXiv preprint arXiv:2303.12712, 2023 | 357 | 2023 |
Textbooks are all you need ii: phi-1.5 technical report Y Li, S Bubeck, R Eldan, A Del Giorno, S Gunasekar, YT Lee arXiv preprint arXiv:2309.05463, 2023 | 338 | 2023 |
Kernel-based methods for bandit convex optimization S Bubeck, R Eldan, YT Lee Journal of the ACM (JACM) 68 (4), 1-35, 2021 | 187 | 2021 |
Testing for high‐dimensional geometry in random graphs S Bubeck, J Ding, R Eldan, MZ Rácz Random Structures & Algorithms 49 (3), 503-532, 2016 | 166 | 2016 |
Phi-2: The surprising power of small language models M Javaheripi, S Bubeck, M Abdin, J Aneja, S Bubeck, CCT Mendes, ... Microsoft Research Blog 1 (3), 3, 2023 | 165 | 2023 |
Tinystories: How small can language models be and still speak coherent english? R Eldan, Y Li arXiv preprint arXiv:2305.07759, 2023 | 161 | 2023 |
Sampling from a log-concave distribution with projected Langevin Monte Carlo S Bubeck, R Eldan, J Lehec Discrete & Computational Geometry 59, 757-783, 2018 | 160 | 2018 |
Thin shell implies spectral gap up to polylog via a stochastic localization scheme R Eldan Geometric and Functional Analysis 23 (2), 532-569, 2013 | 155 | 2013 |
Who's Harry Potter? Approximate Unlearning in LLMs R Eldan, M Russinovich arXiv preprint arXiv:2310.02238, 2023 | 122 | 2023 |
Gaussian-width gradient complexity, reverse log-Sobolev inequalities and nonlinear large deviations R Eldan Geometric and Functional Analysis 28 (6), 1548-1596, 2018 | 95 | 2018 |
A two-sided estimate for the Gaussian noise stability deficit R Eldan Inventiones mathematicae 201, 561-624, 2015 | 87 | 2015 |
Multi-scale exploration of convex functions and bandit convex optimization S Bubeck, R Eldan Conference on Learning Theory, 583-589, 2016 | 84 | 2016 |
Localization schemes: A framework for proving mixing bounds for Markov chains Y Chen, R Eldan 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS …, 2022 | 82 | 2022 |
Approximately gaussian marginals and the hyperplane conjecture R Eldan, B Klartag Concentration, functional inequalities and isoperimetry 545, 55-68, 2011 | 73 | 2011 |
Unveiling transformers with lego: a synthetic reasoning task Y Zhang, A Backurs, S Bubeck, R Eldan, S Gunasekar, T Wagner arXiv preprint arXiv:2206.04301, 2022 | 69 | 2022 |
Sparks of artificial general intelligence: Early experiments with gpt-4. arXiv 2023 S Bubeck, V Chandrasekaran, R Eldan, J Gehrke, E Horvitz, E Kamar, ... arXiv preprint arXiv:2303.12712 10, 0 | 68 | |