Follow
Julian Schrittwieser
Julian Schrittwieser
DeepMind
Verified email at furidamu.org - Homepage
Title
Cited by
Cited by
Year
Mastering the game of Go with deep neural networks and tree search
D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ...
nature 529 (7587), 484-489, 2016
199302016
Mastering the game of go without human knowledge
D Silver, J Schrittwieser, K Simonyan, I Antonoglou, A Huang, A Guez, ...
nature 550 (7676), 354-359, 2017
113572017
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
Science 362 (6419), 1140-1144, 2018
47432018
Mastering atari, go, chess and shogi by planning with a learned model
J Schrittwieser, I Antonoglou, T Hubert, K Simonyan, L Sifre, S Schmitt, ...
Nature 588 (7839), 604-609, 2020
24782020
Mastering chess and shogi by self-play with a general reinforcement learning algorithm
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
arXiv preprint arXiv:1712.01815, 2017
23182017
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
15832023
Starcraft ii: A new challenge for reinforcement learning
O Vinyals, T Ewalds, S Bartunov, P Georgiev, AS Vezhnevets, M Yeo, ...
arXiv preprint arXiv:1708.04782, 2017
10882017
Competition-level code generation with alphacode
Y Li, D Choi, J Chung, N Kushman, J Schrittwieser, R Leblond, T Eccles, ...
Science 378 (6624), 1092-1097, 2022
8852022
Deepmind lab
C Beattie, JZ Leibo, D Teplyashin, T Ward, M Wainwright, H Küttler, ...
arXiv preprint arXiv:1612.03801, 2016
6152016
Discovering faster matrix multiplication algorithms with reinforcement learning
A Fawzi, M Balog, A Huang, T Hubert, B Romera-Paredes, M Barekatain, ...
Nature 610 (7930), 47-53, 2022
5632022
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
3952024
Cyprien de Masson d’Autume, Igor Babuschkin, Xinyun Chen, Po-Sen Huang, Johannes Welbl, Sven Gowal, Alexey Cherepanov, James Molloy, Daniel J
Y Li, D Choi, J Chung, N Kushman, J Schrittwieser, R Leblond, T Eccles, ...
Science 378 (6624), 1092-1097, 2022
2732022
OpenSpiel: A framework for reinforcement learning in games
M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ...
arXiv preprint arXiv:1908.09453, 2019
2712019
Mastering chess and shogi by self-play with a general reinforcement learning algorithm. arXiv 2017
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
arXiv preprint arXiv:1712.01815, 2017
1602017
Bayesian optimization in alphago
Y Chen, A Huang, Z Wang, I Antonoglou, J Schrittwieser, D Silver, ...
arXiv preprint arXiv:1812.06855, 2018
1572018
Gemini: A family of highly capable multimodal models
R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805 1, 2023
1502023
Faster sorting algorithms discovered using deep reinforcement learning
DJ Mankowitz, A Michi, A Zhernov, M Gelmi, M Selvi, C Paduraru, ...
Nature 618 (7964), 257-263, 2023
1422023
Online and offline reinforcement learning by planning with a learned model
J Schrittwieser, T Hubert, A Mandhane, M Barekatain, I Antonoglou, ...
Advances in Neural Information Processing Systems 34, 27580-27591, 2021
1172021
Learning and planning in complex action spaces
T Hubert, J Schrittwieser, I Antonoglou, M Barekatain, S Schmitt, D Silver
International Conference on Machine Learning, 4476-4486, 2021
792021
Policy improvement by planning with Gumbel
I Danihelka, A Guez, J Schrittwieser, D Silver
International Conference on Learning Representations, 2022
612022
The system can't perform the operation now. Try again later.
Articles 1–20