Yuanzhi Li

Cited by

	All	Since 2019
Citations	20520	19620
h-index	51	49
i10-index	91	90

8000

4000

2000

6000

20162017201820192020202120222023202478 175 456 1053 1479 1822 2193 5637 7366

Public access

View all

36 articles

0 articles

available

not available

Based on funding mandates

Yuanzhi Li

Assistant Professor at CMU

Verified email at andrew.cmu.edu - Homepage

Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Lora: Low-rank adaptation of large language models EJ Hu, Y Shen, P Wallis, Z Allen-Zhu, Y Li, S Wang, L Wang, W Chen arXiv preprint arXiv:2106.09685, 2021	5825	2021
Sparks of artificial general intelligence: Early experiments with gpt-4 S Bubeck, V Chandrasekaran, R Eldan, J Gehrke, E Horvitz, E Kamar, ... arXiv preprint arXiv:2303.12712, 2023	2665	2023
A convergence theory for deep learning via over-parameterization Z Allen-Zhu, Y Li, Z Song International conference on machine learning, 242-252, 2019	1550	2019
Learning and generalization in overparameterized neural networks, going beyond two layers Z Allen-Zhu, Y Li, Y Liang Advances in neural information processing systems 32, 2019	839	2019
Convergence analysis of two-layer neural networks with relu activation Y Li, Y Yuan Advances in neural information processing systems 30, 2017	747	2017
Learning overparameterized neural networks via stochastic gradient descent on structured data Y Li, Y Liang Advances in neural information processing systems 31, 2018	709	2018
A theoretical analysis of NDCG type ranking measures Y Wang, L Wang, Y Li, D He, TY Liu Conference on learning theory, 25-54, 2013	691	2013
A latent variable model approach to pmi-based word embeddings S Arora, Y Li, Y Liang, T Ma, A Risteski Transactions of the Association for Computational Linguistics 4, 385-399, 2016	646*	2016
Towards understanding ensemble, knowledge distillation and self-distillation in deep learning Z Allen-Zhu, Y Li arXiv preprint arXiv:2012.09816, 2020	363	2020
An alternative view: When does SGD escape local minima? B Kleinberg, Y Li, Y Yuan International conference on machine learning, 2698-2707, 2018	348	2018
Algorithmic regularization in over-parameterized matrix sensing and neural networks with quadratic activations Y Li, T Ma, H Zhang Conference On Learning Theory, 2-47, 2018	338	2018
Towards explaining the regularization effect of initial large learning rate in training neural networks Y Li, C Wei, T Ma Advances in neural information processing systems 32, 2019	327	2019
Textbooks are all you need S Gunasekar, Y Zhang, J Aneja, CCT Mendes, A Del Giorno, S Gopi, ... arXiv preprint arXiv:2306.11644, 2023	314	2023
Linear algebraic structure of word senses, with applications to polysemy S Arora, Y Li, Y Liang, T Ma, A Risteski Transactions of the Association for Computational Linguistics 6, 483-495, 2018	258	2018
Algorithmic framework for model-based deep reinforcement learning with theoretical guarantees Y Luo, H Xu, Y Li, Y Tian, T Darrell, T Ma arXiv preprint arXiv:1807.03858, 2018	247	2018
Gradient descent on neural networks typically occurs at the edge of stability J Cohen, S Kaur, Y Li, JZ Kolter, A Talwalkar International Conference on Learning Representations, 2021	216	2021
What can resnet learn efficiently, going beyond kernels? Z Allen-Zhu, Y Li Advances in Neural Information Processing Systems 32, 2019	216	2019
Textbooks are all you need ii: phi-1.5 technical report Y Li, S Bubeck, R Eldan, A Del Giorno, S Gunasekar, YT Lee arXiv preprint arXiv:2309.05463, 2023	201	2023
On the convergence rate of training recurrent neural networks Z Allen-Zhu, Y Li, Z Song Advances in neural information processing systems 32, 2019	199	2019
Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptions S Chen, S Chewi, J Li, Y Li, A Salim, AR Zhang arXiv preprint arXiv:2209.11215, 2022	197	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by