Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 2070 | 2023 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 614 | 2024 |
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020 | 265 | 2020 |
The DeepMind JAX Ecosystem I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ... URL http://github. com/deepmind 24, 25, 2020 | 163* | 2020 |
Human-timescale adaptation in an open-ended task space AA Team, J Bauer, K Baumli, S Baveja, F Behbahani, A Bhoopchand, ... arXiv preprint arXiv:2301.07608, 2023 | 103* | 2023 |
Relative Variational Intrinsic Control K Baumli, D Warde-Farley, S Hansen, V Mnih Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 6732-6740, 2021 | 48 | 2021 |
Learning more skills through optimistic exploration DJ Strouse, K Baumli, D Warde-Farley, V Mnih, S Hansen arXiv preprint arXiv:2107.14226, 2021 | 47 | 2021 |
Discovering policies with domino: Diversity optimization maintaining near optimality T Zahavy, Y Schroecker, F Behbahani, K Baumli, S Flennerhag, S Hou, ... arXiv preprint arXiv:2205.13521, 2022 | 40 | 2022 |
The DeepMind JAX Ecosystem, 2020 IB DeepMind, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ... URL http://github. com/google-deepmind, 0 | 28 | |
Training language models to self-correct via reinforcement learning A Kumar, V Zhuang, R Agarwal, Y Su, JD Co-Reyes, A Singh, K Baumli, ... arXiv preprint arXiv:2409.12917, 2024 | 15 | 2024 |
Vision-language models as a source of rewards K Baumli, S Baveja, F Behbahani, H Chan, G Comanici, S Flennerhag, ... arXiv preprint arXiv:2312.09187, 2023 | 14 | 2023 |
Self-consistent models and values G Farquhar, K Baumli, Z Marinho, A Filos, M Hessel, HP van Hasselt, ... Advances in Neural Information Processing Systems 34, 1111-1125, 2021 | 13 | 2021 |
Entropic desired dynamics for intrinsic control S Hansen, G Desjardins, K Baumli, D Warde-Farley, N Heess, S Osindero, ... Advances in Neural Information Processing Systems 34, 11436-11448, 2021 | 7 | 2021 |
Controlling agents using relative variational intrinsic control DCP Warde-Farley, SS Hansen, V Mnih, KA Baumli US Patent App. 18/025,304, 2023 | | 2023 |