Gregory Farquhar

Cited by

	All	Since 2019
Citations	6955	6714
h-index	14	14
i10-index	18	18

2100

1050

525

1575

2017201820192020202120222023202448 154 436 767 1184 1643 2053 626

Public access

View all

13 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoVerified email at cs.ox.ac.uk
Jakob FoersterAssociate Professor, University of OxfordVerified email at eng.ox.ac.uk
Nantas NardelliStealthVerified email at arbitrarygravitas.com
Philip TorrProfessor, University of OxfordVerified email at eng.ox.ac.uk
Triantafyllos AfourasFAIR, Meta, University of OxfordVerified email at fb.com
Tim RocktäschelProfessor of Artificial Intelligence at UCL, Open-Endedness Team Lead at Google DeepMindVerified email at cs.ucl.ac.uk
Pushmeet KohliDeepMindVerified email at google.com

Gregory Farquhar

DeepMind

Verified email at google.com

Reinforcement Learning Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Monotonic value function factorisation for deep multi-agent reinforcement learning T Rashid, M Samvelyan, CS De Witt, G Farquhar, J Foerster, S Whiteson Journal of Machine Learning Research 21 (178), 1-51, 2020	2122	2020
Counterfactual multi-agent policy gradients J Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2032	2018
The starcraft multi-agent challenge M Samvelyan, T Rashid, CS De Witt, G Farquhar, N Nardelli, TGJ Rudner, ... arXiv preprint arXiv:1902.04043, 2019	906	2019
Stabilising experience replay for deep multi-agent reinforcement learning J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ... International conference on machine learning, 1146-1155, 2017	710	2017
Weighted qmix: Expanding monotonic value function factorisation for deep multi-agent reinforcement learning T Rashid, G Farquhar, B Peng, S Whiteson Advances in neural information processing systems 33, 10199-10210, 2020	314	2020
A survey of reinforcement learning informed by natural language J Luketina, N Nardelli, G Farquhar, J Foerster, J Andreas, E Grefenstette, ... arXiv preprint arXiv:1906.03926, 2019	282	2019
Treeqn and atreec: Differentiable tree-structured models for deep reinforcement learning G Farquhar, T Rocktäschel, M Igl, S Whiteson arXiv preprint arXiv:1710.11417, 2017	140	2017
Multi-agent common knowledge reinforcement learning C Schroeder de Witt, J Foerster, G Farquhar, P Torr, W Boehmer, ... Advances in neural information processing systems 32, 2019	111*	2019
Dice: The infinitely differentiable monte carlo estimator J Foerster, G Farquhar, M Al-Shedivat, T Rocktäschel, E Xing, S Whiteson International Conference on Machine Learning, 1529-1538, 2018	92	2018
Transient non-stationarity and generalisation in deep reinforcement learning M Igl, G Farquhar, J Luketina, W Boehmer, S Whiteson arXiv preprint arXiv:2006.05826, 2020	65	2020
Growing action spaces G Farquhar, L Gustafson, Z Lin, S Whiteson, N Usunier, G Synnaeve International Conference on Machine Learning, 3040-3051, 2020	33	2020
Proper value equivalence C Grimm, A Barreto, G Farquhar, D Silver, S Singh Advances in Neural Information Processing Systems 34, 7773-7786, 2021	31	2021
The impact of non-stationarity on generalisation in deep reinforcement learning M Igl, G Farquhar, J Luketina, W Boehmer, S Whiteson arXiv preprint arXiv:2006.05826 8, 2020	29	2020
Psiphi-learning: Reinforcement learning with demonstrations using successor features and inverse temporal difference learning A Filos, C Lyle, Y Gal, S Levine, N Jaques, G Farquhar International Conference on Machine Learning, 3305-3317, 2021	25	2021
A baseline for any order gradient estimation in stochastic computation graphs J Mao, J Foerster, T Rocktäschel, M Al-Shedivat, G Farquhar, S Whiteson International Conference on Machine Learning, 4343-4351, 2019	12	2019
Counterfactual multi-agent policy gradients. CoRR abs/1705.08926 (2017) JN Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson arXiv preprint arXiv:1705.08926, 2017	11	2017
Self-consistent models and values G Farquhar, K Baumli, Z Marinho, A Filos, M Hessel, HP van Hasselt, ... Advances in Neural Information Processing Systems 34, 1111-1125, 2021	10	2021
Loaded DiCE: Trading off bias and variance in any-order score function gradient estimators for reinforcement learning G Farquhar, S Whiteson, J Foerster Advances in Neural Information Processing Systems 32, 2019	10	2019
Model-value inconsistency as a signal for epistemic uncertainty A Filos, E Vértes, Z Marinho, G Farquhar, D Borsa, A Friesen, ... arXiv preprint arXiv:2112.04153, 2021	9	2021
No DICE: An investigation of the bias-variance tradeoff in meta-gradients R Vuorio, JA Beck, G Farquhar, JN Foerster, S Whiteson Deep RL Workshop NeurIPS 2021, 2021	5	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors