Mengdi Wang

Cited by

	All	Since 2019
Citations	5785	5408
h-index	42	42
i10-index	86	82

1500

750

375

1125

201520162017201820192020202120222023202423 44 112 153 263 499 917 1107 1444 1177

Public access

View all

45 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Lin F. Yang (杨林)Assistant Professor, Department of Electrical and Computer Engineering @ UCLAVerified email at ee.ucla.edu
Alec KoppelAI Research Lead, JP Morgan AI ResearchVerified email at jpmchase.com
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Yinyu YeK.T. Li Professor of Engineering, Stanford UniversityVerified email at stanford.edu
Tuo ZhaoAssistant Professor, Georgia TechVerified email at gatech.edu
Dimitri BertsekasArizona State University - Massachusetts Institute of TechnologyVerified email at mit.edu
Ethan X. FangAssociate Professor at Duke UniversityVerified email at duke.edu
Aaron SidfordStanford UniversityVerified email at stanford.edu
Botao HaoOpenAIVerified email at openai.com
Anru ZhangDuke UniversityVerified email at duke.edu
Michael I. JordanProfessor of Electrical Engineering and Computer Sciences and Professor of Statistics, UC BerkeleyVerified email at cs.berkeley.edu
Zhaoran WangAssistant Professor at Northwestern UniversityVerified email at northwestern.edu
Yu-Xiang WangAssociate Professor @ UC San DiegoVerified email at ucsd.edu
Tong ZhangUIUCVerified email at tongzhang-ml.org
Prateek MittalProfessor, Princeton UniversityVerified email at princeton.edu
Saeed GhadimiUniversity of WaterlooVerified email at uwaterloo.ca
Tor LattimoreDeepMindVerified email at google.com
Andrzej RuszczyńskiBoard of Governors Professor of Rutgers UniversityVerified email at business.rutgers.edu
Zheng Tracy KeHarvard UniversityVerified email at fas.harvard.edu
Lihong Li (李力鸿)AmazonVerified email at amazon.com

Mengdi Wang

Center for Statistics & Machine Learning, ECE, Princeton University

Verified email at princeton.edu - Homepage

reinforcement learning optimization machine learning data science control


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Sample-optimal parametric q-learning using linearly additive features L Yang, M Wang International conference on machine learning, 6995-7004, 2019	354	2019
Model-based reinforcement learning with value-targeted regression A Ayoub, Z Jia, C Szepesvari, M Wang, L Yang International Conference on Machine Learning, 463-474, 2020	319	2020
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound LF Yang, M Wang International Conference on Machine Learning, 2020, 2019	314	2019
Stochastic compositional gradient descent: algorithms for minimizing compositions of expected-value functions M Wang, EX Fang, H Liu Mathematical Programming 161, 419-449, 2017	271	2017
Near-optimal time and sample complexities for solving Markov decision processes with a generative model A Sidford, M Wang, X Wu, L Yang, Y Ye Advances in Neural Information Processing Systems 31, 2018	254	2018
Approximation methods for bilevel programming S Ghadimi, M Wang arXiv preprint arXiv:1802.02246, 2018	235	2018
Minimax-optimal off-policy evaluation with linear function approximation Y Duan, Z Jia, M Wang International Conference on Machine Learning, 2701-2709, 2020	161	2020
Accelerating stochastic composition optimization M Wang, J Liu, EX Fang Journal of Machine Learning Research, 2017, 2016	154	2016
Variance reduced value iteration and faster algorithms for solving markov decision processes A Sidford, M Wang, X Wu, Y Ye. Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete …, 2017	140*	2017
Variational policy gradient method for reinforcement learning with general utilities J Zhang, A Koppel, AS Bedi, C Szepesvari, M Wang Advances in Neural Information Processing Systems 2020, 2020	135	2020
A single timescale stochastic approximation method for nested stochastic optimization S Ghadimi, A Ruszczynski, M Wang SIAM Journal on Optimization 30 (1), 960-979, 2020	123	2020
Stochastic first-order methods with random constraint projection M Wang, DP Bertsekas SIAM Journal on Optimization 26 (1), 681-717, 2016	120*	2016
Visual adversarial examples jailbreak aligned large language models X Qi, K Huang, A Panda, P Henderson, M Wang, P Mittal Proceedings of the AAAI Conference on Artificial Intelligence 38 (19), 21527 …, 2024	106*	2024
On function approximation in reinforcement learning: Optimism in the face of large state spaces Z Yang, C Jin, Z Wang, M Wang, MI Jordan arXiv preprint arXiv:2011.04622, 2020	98*	2020
Towards compact cnns via collaborative compression Y Li, S Lin, J Liu, Q Ye, M Wang, F Chao, F Yang, J Ma, Q Tian, R Ji Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	95	2021
Finite-sum composition optimization via variance reduced gradient descent X Lian, M Wang, J Liu Artificial Intelligence and Statistics. 2017., 2016	95	2016
Score approximation, estimation and distribution recovery of diffusion models on low-dimensional data M Chen, K Huang, T Zhao, M Wang International Conference on Machine Learning, 2024	83	2024
Solving discounted stochastic two-player games with near-optimal time and sample complexity A Sidford, M Wang, L Yang, Y Ye International Conference on Artificial Intelligence and Statistics, 2992-3002, 2020	82	2020
Randomized linear programming solves the markov decision problem in nearly linear (sometimes sublinear) time M Wang Mathematics of Operations Research 45 (2), 517-546, 2020	80*	2020
A distributed tracking algorithm for reconstruction of graph signals X Wang, M Wang, Y Gu IEEE Journal of Selected Topics in Signal Processing 9 (4), 728-740, 2015	79	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors