Follow
Tianpei Yang
Title
Cited by
Cited by
Year
Exploration in deep reinforcement learning: a comprehensive survey
T Yang, H Tang, C Bai, J Liu, J Hao, Z Meng, P Liu, Z Wang
arXiv preprint arXiv:2109.06668, 2021
92*2021
From few to more: Large-scale dynamic multiagent curriculum learning
W Wang *, T Yang*, Y Liu*, J Hao, X Hao, Y Hu, Y Chen, C Fan, Y Gao
Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7293-7300, 2020
832020
A deep bayesian policy reuse approach against non-stationary agents
Y Zheng, Z Meng, J Hao, Z Zhang, T Yang, C Fan
Advances in neural information processing systems 31, 2018
752018
Towards efficient detection and optimal response against sophisticated opponents
T Yang, Z Meng, J Hao, C Zhang, Y Zheng, Z Zheng
Proceedings of the 28th International Joint Conference on Artificial …, 2018
372018
A survey on interpretable reinforcement learning
C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao, W Liu
arXiv preprint arXiv:2112.13112, 2021
302021
Action semantics network: Considering the effects of actions in multiagent systems
W Wang*, T Yang*, Y Liu, J Hao, X Hao, Y Hu, Y Chen, C Fan, Y Gao
Proceedings of the 8th International Conference on Learning Representations, 2019
262019
Efficient deep reinforcement learning via adaptive policy transfer
T Yang, J Hao, Z Meng, Z Zhang, Y Hu, Y Cheng, C Fan, W Wang, W Liu, ...
Proceedings of the Twenty-Ninth International Joint Conference on Artificial …, 2020
252020
An efficient transfer learning framework for multiagent reinforcement learning
T Yang, W Wang, H Tang, J Hao, Z Meng, H Mao, D Li, W Liu, Y Chen, ...
Advances in Neural Information Processing Systems 34, 17037-17048, 2021
22*2021
Learning action-transferable policy with action embedding
Y Chen, Y Chen, Z Hu, T Yang, C Fan, Y Yu, J Hao
arXiv preprint arXiv:1909.02291, 2019
172019
Efficient policy detecting and reusing for non-stationarity in markov games
Y Zheng, J Hao, Z Zhang, Z Meng, T Yang, Y Li, C Fan
Autonomous Agents and Multi-Agent Systems 35, 1-29, 2021
142021
Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents.
T Yang, J Hao, Z Meng, Y Zheng, C Zhang, Z Zheng
AAMAS, 2282-2284, 2019
142019
Accelerating Norm Emergence Through Hierarchical Heuristic Learning.
T Yang, Z Meng, J Hao, S Sen, C Yu
Proceedings of 22nd European Conference on Artificial Intelligence (ECAI …, 2016
142016
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
P Li, H Tang, T Yang, X Hao, T Sang, Y Zheng, J Hao, ME Taylor, Z Wang
International Conference on Machine Learning 162, 12979-12997, 2022
112022
Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
C Zhang, Y Tian, Z Zhang, W Xue, X Xie, T Yang, X Ge, R Chen
IEEE Transactions on Intelligent Transportation Systems 23 (12), 25157-25168, 2022
102022
Efficient Deep Reinforcement Learning through Policy Transfer.
T Yang, J Hao, Z Meng, Z Zhang, Y Hu, Y Chen, C Fan, W Wang, Z Wang, ...
AAMAS, 2053-2055, 2020
82020
Cross-domain Adaptive Transfer Reinforcement Learning Based on State-Action Correspondence
H You, T Yang, Y Zheng, J Hao, ME Taylor
The 38th Conference on Uncertainty in Artificial Intelligence, 2022
42022
Advertising Impression Resource Allocation Strategy with Multi-Level Budget Constraint DQN in Real-Time Bidding
C Zhang, K Zheng, Y Tian, W Xue, T Yang, D An, Y Pi, R Chen
Neurocomputing 488, 647-656, 2022
32022
NeurIPS 2022 Competition: Driving SMARTS
A Rasouli, R Goebel, ME Taylor, I Kotseruba, S Alizadeh, T Yang, ...
arXiv preprint arXiv:2211.07545, 2022
22022
GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis
Y Cao, Z Li, T Yang, H Zhang, Y Zheng, Y Li, J Hao, Y Liu
Advances in Neural Information Processing Systems 35, 19930-19943, 2022
22022
Learning to shape rewards using a game of two partners
D Mguni, T Jafferjee, J Wang, N Perez-Nieves, W Song, F Tong, M Taylor, ...
Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11604 …, 2023
12023
The system can't perform the operation now. Try again later.
Articles 1–20