Zongzhang Zhang

Cited by

	All	Since 2019
Citations	1335	1225
h-index	17	16
i10-index	34	31

300

150

225

2013201420152016201720182019202020212022202320247 9 8 23 6 45 85 154 179 253 299 255

Public access

View all

44 articles

1 article

available

not available

Based on funding mandates

Co-authors

Yang YuProfessor, Nanjing UniversityVerified email at nju.edu.cn
Yan Zheng (郑岩)Tianjin UniversityVerified email at tju.edu.cn
Yingfeng Chen(陈赢峰)Fuxi AI Lab in NeteaseVerified email at mail.ustc.edu.cn
Tianpei YangUniversity of AlbertaVerified email at ualberta.ca
Wulong LiuHuawei Noah's Ark LabVerified email at huawei.com
Mykel J. KochenderferAssociate Professor, Stanford UniversityVerified email at stanford.edu
Aijun BaiGoogle ResearchVerified email at google.com
David HsuProfessor of Computer Science, National University of SingaporeVerified email at comp.nus.edu.sg
Wee Sun LeeProfessor, Department of Computer Science, National University of SingaporeVerified email at comp.nus.edu.sg
Yuzheng ZhuangSenior Researcher @ Huawei Noah's Ark LabVerified email at huawei.com
Feng WuAssociate Professor, University of Science and Technology of ChinaVerified email at ustc.edu.cn
Michael LittmanBrown UniversityVerified email at brown.edu
Zhan Wei LimNational University of SingaporeVerified email at comp.nus.edu.sg

Zongzhang Zhang

Nanjing University

Verified email at nju.edu.cn - Homepage

Artificial Intelligence Reinforcement Learning Probabilistic Planning Imitation Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A survey on deep reinforcement learning Q Liu, JW Zhai, ZZ Zhang, S Zhong, Q Zhou, P Zhang, J Xu Chinese Journal of Computers 41 (1), 1-27, 2018	194	2018
深度强化学习综述刘全，翟建伟，章宗长，钟珊，周倩，章鹏，徐进计算机学报 41 (1), 1-27, 2018	106	2018
Weighted double Q-learning Z Zhang, Z Pan, MJ Kochenderfer IJCAI-2017, 3455-3461, 2017	105	2017
A deep Bayesian policy reuse approach against non-stationary agents Y Zheng, Z Meng, J Hao, Z Zhang, T Yang, C Fan NeurIPS-2018, 954-964, 2018	85	2018
Hierarchical deep multiagent reinforcement learning with temporal abstraction H Tang, J Hao, T Lv, Y Chen, Z Zhang, H Jia, C Ren, Y Zheng, Z Meng, ... arXiv preprint arXiv:1809.09332, 2018	77	2018
Weighted double deep multiagent reinforcement learning in stochastic cooperative environments Y Zheng, Z Meng, J Hao, Z Zhang PRICAI-2018, 421-429, 2018	46	2018
A survey on deep reinforcement learning L Quan, Z Jianwei, Z Zongchang, Z Shan, Z Qian Chinese Journal of Computers 41 (01), 1-27, 2018	44	2018
Multi-Agent Incentive Communication via Decentralized Teammate Modeling L Yuan, J Wang, F Zhang, C Wang, Z Zhang, Y Yu, C Zhang AAAI-2022, 9466-9474, 2022	42	2022
Efficient deep reinforcement learning via adaptive policy transfer T Yang, J Hao, Z Meng, Z Zhang, Y Hu, Y Chen, C Fan, W Wang, W Liu, ... IJCAI-2020, 3094-3100, 2020	35	2020
Triple-GAIL: A multi-modal imitation learning framework with generative adversarial Nets C Fei, B Wang, Y Zhuang, Z Zhang, J Hao, H Zhang, X Ji, W Liu IJCAI-2020, 2929-2935, 2020	34	2020
Deep Q-learning with prioritized sampling J Zhai, Q Liu, Z Zhang, S Zhong, H Zhu, P Zhang, C Sun ICONIP-2016, 13-22, 2016	33	2016
Thompson sampling based Monte-Carlo planning in POMDPs A Bai, F Wu, Z Zhang, X Chen ICAPS-2014, 28-36, 2014	28	2014
Adapt to Environment Sudden Changes by Learning a Context Sensitive Policy FM Luo, S Jiang, Y Yu, Z Zhang, YF Zhang AAAI-2022, 7637-7646, 2022	27	2022
Covering number for efficient heuristic-based POMDP planning Z Zhang, D Hsu, WS Lee ICML-2014, 28-36, 2014	24	2014
Covering number as a complexity measure for POMDP planning and learning Z Zhang, M Littman, X Chen AAAI-2012, 1853-1859, 2012	23	2012
Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data F Zhang, C Jia, YC Li, L Yuan, Y Yu, Z Zhang ICLR-2023, 2023	22	2023
Multi-agent Dynamic Algorithm Configuration K Xue, J Xu, L Yuan, M Li, C Qian, Z Zhang, Y Yu NeurIPS-2022, 20147-20161, 2022	20	2022
Efficient Multi-agent Communication via Self-supervised Information Aggregation C Guan, F Chen, L Yuan, C Wang, H Yin, Z Zhang, Y Yu NeurIPS-2022, 1020-1033, 2022	17	2022
Efficient policy detecting and reusing for non-stationarity in markov games Y Zheng, J Hao, Z Zhang, Z Meng, T Yang, Y Li, C Fan Autonomous Agents and Multi-Agent Systems 35, 1-29, 2021	16	2021
Adaptive Online Packing-guided Search for POMDPs C Wu, G Yang, Z Zhang, Y Yu, D Li, W Liu NeurIPS-2021, 28419-28430, 2021	15	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors