Exploration in deep reinforcement learning: a comprehensive survey T Yang, H Tang, C Bai, J Liu, J Hao, Z Meng, P Liu, Z Wang arXiv preprint arXiv:2109.06668, 2021 | 92* | 2021 |
From few to more: Large-scale dynamic multiagent curriculum learning W Wang *, T Yang*, Y Liu*, J Hao, X Hao, Y Hu, Y Chen, C Fan, Y Gao Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7293-7300, 2020 | 83 | 2020 |
A deep bayesian policy reuse approach against non-stationary agents Y Zheng, Z Meng, J Hao, Z Zhang, T Yang, C Fan Advances in neural information processing systems 31, 2018 | 75 | 2018 |
Towards efficient detection and optimal response against sophisticated opponents T Yang, Z Meng, J Hao, C Zhang, Y Zheng, Z Zheng Proceedings of the 28th International Joint Conference on Artificial …, 2018 | 37 | 2018 |
A survey on interpretable reinforcement learning C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao, W Liu arXiv preprint arXiv:2112.13112, 2021 | 30 | 2021 |
Action semantics network: Considering the effects of actions in multiagent systems W Wang*, T Yang*, Y Liu, J Hao, X Hao, Y Hu, Y Chen, C Fan, Y Gao Proceedings of the 8th International Conference on Learning Representations, 2019 | 26 | 2019 |
Efficient deep reinforcement learning via adaptive policy transfer T Yang, J Hao, Z Meng, Z Zhang, Y Hu, Y Cheng, C Fan, W Wang, W Liu, ... Proceedings of the Twenty-Ninth International Joint Conference on Artificial …, 2020 | 25 | 2020 |
An efficient transfer learning framework for multiagent reinforcement learning T Yang, W Wang, H Tang, J Hao, Z Meng, H Mao, D Li, W Liu, Y Chen, ... Advances in Neural Information Processing Systems 34, 17037-17048, 2021 | 22* | 2021 |
Learning action-transferable policy with action embedding Y Chen, Y Chen, Z Hu, T Yang, C Fan, Y Yu, J Hao arXiv preprint arXiv:1909.02291, 2019 | 17 | 2019 |
Efficient policy detecting and reusing for non-stationarity in markov games Y Zheng, J Hao, Z Zhang, Z Meng, T Yang, Y Li, C Fan Autonomous Agents and Multi-Agent Systems 35, 1-29, 2021 | 14 | 2021 |
Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents. T Yang, J Hao, Z Meng, Y Zheng, C Zhang, Z Zheng AAMAS, 2282-2284, 2019 | 14 | 2019 |
Accelerating Norm Emergence Through Hierarchical Heuristic Learning. T Yang, Z Meng, J Hao, S Sen, C Yu Proceedings of 22nd European Conference on Artificial Intelligence (ECAI …, 2016 | 14 | 2016 |
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration P Li, H Tang, T Yang, X Hao, T Sang, Y Zheng, J Hao, ME Taylor, Z Wang International Conference on Machine Learning 162, 12979-12997, 2022 | 11 | 2022 |
Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions C Zhang, Y Tian, Z Zhang, W Xue, X Xie, T Yang, X Ge, R Chen IEEE Transactions on Intelligent Transportation Systems 23 (12), 25157-25168, 2022 | 10 | 2022 |
Efficient Deep Reinforcement Learning through Policy Transfer. T Yang, J Hao, Z Meng, Z Zhang, Y Hu, Y Chen, C Fan, W Wang, Z Wang, ... AAMAS, 2053-2055, 2020 | 8 | 2020 |
Cross-domain Adaptive Transfer Reinforcement Learning Based on State-Action Correspondence H You, T Yang, Y Zheng, J Hao, ME Taylor The 38th Conference on Uncertainty in Artificial Intelligence, 2022 | 4 | 2022 |
Advertising Impression Resource Allocation Strategy with Multi-Level Budget Constraint DQN in Real-Time Bidding C Zhang, K Zheng, Y Tian, W Xue, T Yang, D An, Y Pi, R Chen Neurocomputing 488, 647-656, 2022 | 3 | 2022 |
NeurIPS 2022 Competition: Driving SMARTS A Rasouli, R Goebel, ME Taylor, I Kotseruba, S Alizadeh, T Yang, ... arXiv preprint arXiv:2211.07545, 2022 | 2 | 2022 |
GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis Y Cao, Z Li, T Yang, H Zhang, Y Zheng, Y Li, J Hao, Y Liu Advances in Neural Information Processing Systems 35, 19930-19943, 2022 | 2 | 2022 |
Learning to shape rewards using a game of two partners D Mguni, T Jafferjee, J Wang, N Perez-Nieves, W Song, F Tong, M Taylor, ... Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11604 …, 2023 | 1 | 2023 |