Kaiqing Zhang
Title
Cited by
Cited by
Year
Fully decentralized multi-agent reinforcement learning with networked agents
K Zhang, Z Yang, H Liu, T Zhang, T Başar
International Conference on Machine Learning (ICML), 2018
2802018
Multi-agent reinforcement learning: A selective overview of theories and algorithms
K Zhang, Z Yang, T Başar
Studies in Systems, Decision and Control, Handbook on RL and Control, 2019
2602019
Global convergence of policy gradient methods to (almost) locally optimal policies
K Zhang, A Koppel, H Zhu, T Başar
SIAM Journal on Control and Optimization (SICON), 2019
75*2019
Dependency analysis and improved parameter estimation for dynamic composite load modeling
K Zhang, H Zhu, S Guo
IEEE Transactions on Power Systems 32 (4), 3287-3297, 2016
722016
Policy optimization provably converges to Nash equilibria in zero-sum linear quadratic games
K Zhang, Z Yang, T Basar
Advances in Neural Information Processing Systems, 11602-11614, 2019
652019
Networked multi-agent reinforcement learning in continuous spaces
K Zhang, Z Yang, T Basar
2018 IEEE Conference on Decision and Control (CDC), 2771-2776, 2018
572018
Policy optimization for linear control with robustness guarantee: Implicit regularization and global convergence
K Zhang, B Hu, T Başar
SIAM Journal on Control and Optimization (SICON), 2021
482021
Dynamic power distribution system management with a locally connected communication network
K Zhang, W Shi, H Zhu, E Dall'Anese, T Basar
IEEE Journal of Selected Topics in Signal Processing (JSTSP), 2018
42*2018
Finite-sample analysis for decentralized batch multi-agent reinforcement learning with networked agents
K Zhang, Z Yang, H Liu, T Zhang, T Başar
IEEE Transactions on Automatic Control, 2018
41*2018
Communication-efficient policy gradient methods for distributed reinforcement learning
T Chen, K Zhang, GB Giannakis, T Başar
IEEE Transactions on Control of Network Systems, 2018
40*2018
Machine learning techniques for spectrum sensing when primary user has multiple transmit powers
K Zhang, J Li, F Gao
2014 IEEE International Conference on Communication Systems, 137-141, 2014
38*2014
A multi-agent off-policy actor-critic algorithm for distributed reinforcement learning
W Suttle, Z Yang, K Zhang, Z Wang, T Başar, J Liu
IFAC-PapersOnLine 53 (2), 1549-1554, 2020
34*2020
Model-based multi-agent RL in zero-sum Markov games with near-optimal sample complexity
K Zhang, S Kakade, T Başar, L Yang
arXiv preprint arXiv:2007.07461, 2020
312020
Consumption behavior analytics-aided energy forecasting and dispatch
Y Zhang, R Yang, K Zhang, H Jiang, JJ Zhang
IEEE Intelligent Systems 32 (4), 59-63, 2017
272017
Natural policy gradient primal-dual method for constrained Markov decision processes
D Ding, K Zhang, T Basar, M Jovanovic
Advances in Neural Information Processing Systems 33, 2020
222020
Optimal joint bidding and pricing of profit-seeking load serving entity
H Xu, K Zhang, J Zhang
IEEE Transactions on Power Systems 33 (5), 5427-5436, 2018
202018
An improved analysis of (variance-reduced) policy gradient and natural policy gradient methods
Y Liu, K Zhang, T Basar, W Yin
Advances in Neural Information Processing Systems 33, 2020
162020
Projected stochastic primal-dual method for constrained online learning with kernels
A Koppel, K Zhang, H Zhu, TM Baser
IEEE Transactions on Signal Processing, 2018
152018
A finite sample analysis of the actor-critic algorithm
Z Yang, K Zhang, M Hong, T Başar
2018 IEEE Conference on Decision and Control (CDC), 2759-2764, 2018
142018
On the performance of map-aware cooperative localization
K Zhang, Y Shen, MZ Win
2016 IEEE International Conference on Communications (ICC), 1-6, 2016
14*2016
The system can't perform the operation now. Try again later.
Articles 1–20