Shixiang Shane Gu

Cited by

	All	Since 2019
Citations	21673	19987
h-index	42	41
i10-index	60	60

7000

3500

1750

5250

201620172018201920202021202220232024124 449 1031 1477 2107 2783 3335 6413 3798

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Sergey LevineUC Berkeley, Physical IntelligenceVerified email at eecs.berkeley.edu
Richard E TurnerProfessor, University of CambridgeVerified email at cam.ac.uk
Zoubin GhahramaniProfessor, University of Cambridge, and Distinguished Researcher, GoogleVerified email at eng.cam.ac.uk
Ilya SutskeverCo-Founder and Chief Scientist of OpenAIVerified email at openai.com
Andriy MnihResearch Scientist at Google DeepMindVerified email at cs.toronto.edu
Hong GeCambridge UniversityVerified email at cam.ac.uk
Steve MannProfessor of Electrical and Computer Engineering, University of TorontoVerified email at eecg.utoronto.ca

Shixiang Shane Gu

Google DeepMind

Verified email at google.com - Homepage

Deep Learning Artificial Intelligence Machine Learning Reinforcement Learning Robotics


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Categorical reparameterization with gumbel-softmax E Jang, S Gu, B Poole arXiv preprint arXiv:1611.01144, 2016	5618	2016
Large language models are zero-shot reasoners T Kojima, SS Gu, M Reid, Y Matsuo, Y Iwasawa Advances in neural information processing systems 35, 22199-22213, 2022	1838	2022
Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates S Gu, E Holly, T Lillicrap, S Levine 2017 IEEE international conference on robotics and automation (ICRA), 3389-3396, 2017	1819	2017
Scaling instruction-finetuned language models HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, Y Li, X Wang, ... Journal of Machine Learning Research 25 (70), 1-53, 2024	1528	2024
Continuous deep q-learning with model-based acceleration S Gu, T Lillicrap, I Sutskever, S Levine International conference on machine learning, 2829-2838, 2016	1212	2016
Continuous deep q-learning with model-based acceleration S Gu, T Lillicrap, I Sutskever, S Levine International conference on machine learning, 2829-2838, 2016	1212	2016
Towards deep neural network architectures robust to adversarial examples S Gu, L Rigazio arXiv preprint arXiv:1412.5068, 2014	987	2014
Data-efficient hierarchical reinforcement learning O Nachum, SS Gu, H Lee, S Levine Advances in neural information processing systems 31, 2018	901	2018
Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023	731	2023
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022	727	2022
A minimalist approach to offline reinforcement learning S Fujimoto, SS Gu Advances in neural information processing systems 34, 20132-20145, 2021	537	2021
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	427	2023
Dynamics-aware unsupervised discovery of skills A Sharma, S Gu, S Levine, V Kumar, K Hausman arXiv preprint arXiv:1907.01657, 2019	405	2019
Q-prop: Sample-efficient policy gradient with an off-policy critic S Gu, T Lillicrap, Z Ghahramani, RE Turner, S Levine arXiv preprint arXiv:1611.02247, 2016	389	2016
Human-centric dialog training via offline reinforcement learning N Jaques, JH Shen, A Ghandeharioun, C Ferguson, A Lapedriza, ... arXiv preprint arXiv:2010.05848, 2020	362*	2020
Temporal difference models: Model-free deep rl for model-based control V Pong, S Gu, M Dalal, S Levine arXiv preprint arXiv:1802.09081, 2018	280	2018
A divergence minimization perspective on imitation learning methods SKS Ghasemipour, R Zemel, S Gu Conference on robot learning, 1259-1277, 2020	254	2020
Sequence tutor: Conservative fine-tuning of sequence generation models with kl-control N Jaques, S Gu, D Bahdanau, JM Hernández-Lobato, RE Turner, D Eck International Conference on Machine Learning, 1645-1654, 2017	250*	2017
Large language models can self-improve J Huang, SS Gu, L Hou, Y Wu, X Wang, H Yu, J Han arXiv preprint arXiv:2210.11610, 2022	248	2022
Near-optimal representation learning for hierarchical reinforcement learning O Nachum, S Gu, H Lee, S Levine arXiv preprint arXiv:1810.01257, 2018	218	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors