Agent57: Outperforming the atari human benchmark AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ...
International conference on machine learning, 507-517, 2020
750 2020 Recurrent experience replay in distributed reinforcement learning S Kapturowski, G Ostrovski, J Quan, R Munos, W Dabney
International conference on learning representations, 2018
607 2018 Never give up: Learning directed exploration strategies AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ...
arXiv preprint arXiv:2002.06038, 2020
390 2020 The DeepMind JAX Ecosystem, 2020 I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ...
URL http://github. com/deepmind 18, 2010
116 2010 Making efficient use of demonstrations to solve hard exploration problems TL Paine, C Gulcehre, B Shahriari, M Denil, M Hoffman, H Soyer, ...
arXiv preprint arXiv:1909.01387, 2019
101 2019 The DeepMind JAX Ecosystem I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ...
GitHub http://github. com/deepmind, 2020
75 2020 Human-level atari 200x faster S Kapturowski, V Campos, R Jiang, N Rakićević, H van Hasselt, ...
arXiv preprint arXiv:2209.07550, 2022
45 2022 The DeepMind JAX Ecosystem, 2020 IB DeepMind, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ...
URL http://github. com/google-deepmind, 0
35 Beyond fine-tuning: Transferring behavior in reinforcement learning V Campos, P Sprechmann, S Hansen, A Barreto, S Kapturowski, ...
arXiv preprint arXiv:2102.13515, 2021
30 2021 The DeepMind JAX Ecosystem IB DeepMind, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ...
URL http://github. com/google-deepmind, 2020
29 2020 Revisiting Peng’s Q( ) for Modern Reinforcement Learning T Kozuno, Y Tang, M Rowland, R Munos, S Kapturowski, W Dabney, ...
International Conference on Machine Learning, 5794-5804, 2021
25 2021 Value-driven hindsight modelling A Guez, F Viola, T Weber, L Buesing, S Kapturowski, D Precup, D Silver, ...
Advances in Neural Information Processing Systems 33, 12499-12509, 2020
22 2020 Transformers need glasses! information over-squashing in language tasks F Barbero, A Banino, S Kapturowski, D Kumaran, J Madeira Araújo, ...
Advances in Neural Information Processing Systems 37, 98111-98142, 2024
16 2024 Temporal difference uncertainties as a signal for exploration S Flennerhag, JX Wang, P Sprechmann, F Visin, A Galashov, ...
arXiv preprint arXiv:2010.02255, 2020
16 2020 Offline actor-critic reinforcement learning scales to large models JT Springenberg, A Abdolmaleki, J Zhang, O Groth, M Bloesch, T Lampe, ...
arXiv preprint arXiv:2402.05546, 2024
12 2024 Coverage as a principle for discovering transferable behavior in reinforcement learning V Campos, P Sprechmann, SS Hansen, A Barreto, C Blundell, A Vitvitskyi, ...
9 2021 RLax: Reinforcement Learning in JAX, 2020 D Budden, M Hessel, J Quan, S Kapturowski, K Baumli, S Bhupatiraju, ...
URL http://github. com/deepmind/rlax, 0
8 Jointly learning exploratory and non-exploratory action selection policies AP Badia, P Sprechmann, A Vitvitskyi, Z Guo, B Piot, SJ Kapturowski, ...
US Patent 11,714,990, 2023
6 2023 Unlocking the power of representations in long-term novelty-based exploration A Saade, S Kapturowski, D Calandriello, C Blundell, P Sprechmann, ...
arXiv preprint arXiv:2305.01521, 2023
6 2023 Never give up: Learning directed exploration strategies A Puigdomènech Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, ...
arXiv e-prints, arXiv: 2002.06038, 2020
6 2020