Suivre
David Budden
David Budden
Google DeepMind
Adresse e-mail validée de csail.mit.edu - Page d'accueil
Titre
Citée par
Citée par
Année
Grandmaster level in StarCraft II using multi-agent reinforcement learning
O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ...
Nature 575 (7782), 350-354, 2019
48702019
Scaling Language Models: Methods, Analysis & Insights from Training Gopher
JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ...
arXiv preprint arXiv:2112.11446, 2021
1205*2021
DeepMind Control Suite
Y Tassa, Y Doron, A Muldal, T Erez, Y Li, DL Casas, D Budden, ...
arXiv preprint arXiv:1801.00690, 2018
1150*2018
Distributed Prioritized Experience Replay
D Horgan, J Quan, D Budden, G Barth-Maron, M Hessel, H van Hasselt, ...
International Conference on Learning Representations (ICLR), 2018
9422018
Distributed Distributional Deterministic Policy Gradients
G Barth-Maron, MW Hoffman, D Budden, W Dabney, D Horgan, A Muldal, ...
International Conference on Learning Representations (ICLR), 2018
6742018
AlphaStar: Mastering the Real-Time Strategy Game StarCraft II
O Vinyals, I Babuschkin, J Chung, M Mathieu, M Jaderberg, W Czarnecki, ...
https://deepmind.com/blog/alphastar-mastering-real-time-strategy-game …, 2019
5702019
The DeepMind JAX Ecosystem
I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ...
http://github.com/deepmind, 2020
390*2020
Playing hard exploration games by watching YouTube
Y Aytar, T Pfaff, D Budden, TL Paine, Z Wang, N de Freitas
Advances in Neural Information Processing Systems (NeurIPS), 2018
3192018
Generative compression
S Santurkar, D Budden, N Shavit
arXiv preprint arXiv:1703.01467, 2017
2582017
Sample Efficient Adaptive Text-to-Speech
Y Chen, Y Assael, B Shillingford, D Budden, S Reed, H Zen, Q Wang, ...
International Conference on Learning Representations (ICLR), 2019
180*2019
Scaling data-driven robotics with reward sketching and batch reinforcement learning
S Cabi, S Gómez Colmenarejo, A Novikov, K Konyushkova, S Reed, ...
Robotics: Science and Systems (RSS), 2020
175*2020
Observe and Look Further: Achieving Consistent Performance on Atari
T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ...
arXiv preprint arXiv:1805.11593, 2018
1502018
Unified Scaling Laws for Routed Language Models
A Clark, D Casas, A Guy, A Mensch, M Paganini, J Hoffmann, B Damoc, ...
International Conference on Machine Learning (ICML), 2022
149*2022
A Global Social Media Survey of Attitudes to Human Genome Editing
T McCaughey, PG Sanfilippo, GEC Gooden, DM Budden, L Fan, ...
Cell Stem Cell 18 (5), 569-572, 2016
1092016
The CLRS Algorithmic Reasoning Benchmark
P Veličković, AP Badia, D Budden, R Pascanu, A Banino, M Dashevskiy, ...
International Conference on Machine Learning (ICML), 2022
962022
Optax: composable gradient transformation and optimisation
M Hessel, D Budden, F Viola, M Rosca, E Sezener, T Hennigan
JAX, http://github. com/deepmind/optax, 2020
87*2020
A Generalized Framework for Population Based Training
A Li, O Spyra, S Perel, V Dalibard, M Jaderberg, C Gu, D Budden, ...
International Conference on Knowledge Discovery & Data Mining (KDD), 2019
792019
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
S De, SL Smith, A Fernando, A Botev, G Cristian-Muraru, A Gu, R Haroun, ...
arXiv preprint arXiv:2402.19427, 2024
72*2024
Task-Relevant Adversarial Imitation Learning
K Zolna, S Reed, A Novikov, SG Colmenarej, D Budden, S Cabi, M Denil, ...
Conference on Robotic Learning (CoRL), 2020
642020
Modular Meta-Learning with Shrinkage
Y Chen, AL Friesen, F Behbahani, D Budden, MW Hoffman, A Doucet, ...
Advances in Neural Information Processing Systems (NeurIPS), 2020
472020
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20