Gregory Farquhar
Gregory Farquhar
DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
Counterfactual multi-agent policy gradients
J Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
7182018
Qmix: Monotonic value function factorisation for deep multi-agent reinforcement learning
T Rashid, M Samvelyan, C Schroeder, G Farquhar, J Foerster, S Whiteson
International Conference on Machine Learning, 4295-4304, 2018
4362018
Stabilising experience replay for deep multi-agent reinforcement learning
J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ...
International conference on machine learning, 1146-1155, 2017
3702017
The starcraft multi-agent challenge
M Samvelyan, T Rashid, CS De Witt, G Farquhar, N Nardelli, TGJ Rudner, ...
arXiv preprint arXiv:1902.04043, 2019
1492019
Treeqn and atreec: Differentiable tree-structured models for deep reinforcement learning
G Farquhar, T Rocktäschel, M Igl, S Whiteson
arXiv preprint arXiv:1710.11417, 2017
952017
A survey of reinforcement learning informed by natural language
J Luketina, N Nardelli, G Farquhar, J Foerster, J Andreas, E Grefenstette, ...
arXiv preprint arXiv:1906.03926, 2019
702019
Dice: The infinitely differentiable monte carlo estimator
J Foerster, G Farquhar, M Al-Shedivat, T Rocktäschel, E Xing, S Whiteson
International Conference on Machine Learning, 1529-1538, 2018
402018
Multi-agent common knowledge reinforcement learning
C Schroeder de Witt, J Foerster, G Farquhar, P Torr, W Boehmer, ...
Advances in Neural Information Processing Systems 32, 9927-9939, 2019
272019
Weighted qmix: Expanding monotonic value function factorisation for deep multi-agent reinforcement learning
T Rashid, G Farquhar, B Peng, S Whiteson
arXiv preprint arXiv:2006.10800, 2020
112020
Weighted qmix: Expanding monotonic value function factorisation
T Rashid, G Farquhar, B Peng, S Whiteson
arXiv e-prints, arXiv: 2006.10800, 2020
112020
Counterfactual multi-agent policy gradients. CoRR abs/1705.08926 (2017)
JN Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson
arXiv preprint arXiv:1705.08926, 2017
82017
The impact of non-stationarity on generalisation in deep reinforcement learning
M Igl, G Farquhar, J Luketina, W Boehmer, S Whiteson
arXiv e-prints, arXiv: 2006.05826, 2020
72020
A baseline for any order gradient estimation in stochastic computation graphs
J Mao, J Foerster, T Rocktäschel, M Al-Shedivat, G Farquhar, S Whiteson
International Conference on Machine Learning, 4343-4351, 2019
62019
Growing action spaces
G Farquhar, L Gustafson, Z Lin, S Whiteson, N Usunier, G Synnaeve
International Conference on Machine Learning, 3040-3051, 2020
52020
Loaded DiCE: trading off bias and variance in any-order score function estimators for reinforcement learning
G Farquhar, S Whiteson, J Foerster
arXiv preprint arXiv:1909.10549, 2019
42019
Transient non-stationarity and generalisation in deep reinforcement learning
M Igl, G Farquhar, J Luketina, W Böhmer, S Whiteson
arXiv preprint arXiv:2006.05826, 2020
12020
Multi-agent common knowledge reinforcement learning
CA Schroeder, J Foerster, G Farquhar, P Torr, W Boehmer, S Whiteson
12019
Proper Value Equivalence
C Grimm, A Barreto, G Farquhar, D Silver, S Singh
arXiv preprint arXiv:2106.10316, 2021
2021
PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning
A Filos, C Lyle, Y Gal, S Levine, N Jaques, G Farquhar
arXiv preprint arXiv:2102.12560, 2021
2021
35TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING, ICML 2018
T Rashid, CS De Witt, G Farquhar, J Foerster, S Whiteson, M Samvelyan, ...
2018
The system can't perform the operation now. Try again later.
Articles 1–20