Segui
Kate Baumli
Kate Baumli
Research Engineer, DeepMind
Email verificata su google.com
Titolo
Citata da
Citata da
Anno
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805, 2023
20702023
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
6142024
Acme: A research framework for distributed reinforcement learning
MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ...
arXiv preprint arXiv:2006.00979, 2020
2652020
The DeepMind JAX Ecosystem
I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ...
URL http://github. com/deepmind 24, 25, 2020
163*2020
Human-timescale adaptation in an open-ended task space
AA Team, J Bauer, K Baumli, S Baveja, F Behbahani, A Bhoopchand, ...
arXiv preprint arXiv:2301.07608, 2023
103*2023
Relative Variational Intrinsic Control
K Baumli, D Warde-Farley, S Hansen, V Mnih
Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 6732-6740, 2021
482021
Learning more skills through optimistic exploration
DJ Strouse, K Baumli, D Warde-Farley, V Mnih, S Hansen
arXiv preprint arXiv:2107.14226, 2021
472021
Discovering policies with domino: Diversity optimization maintaining near optimality
T Zahavy, Y Schroecker, F Behbahani, K Baumli, S Flennerhag, S Hou, ...
arXiv preprint arXiv:2205.13521, 2022
402022
The DeepMind JAX Ecosystem, 2020
IB DeepMind, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ...
URL http://github. com/google-deepmind, 0
28
Training language models to self-correct via reinforcement learning
A Kumar, V Zhuang, R Agarwal, Y Su, JD Co-Reyes, A Singh, K Baumli, ...
arXiv preprint arXiv:2409.12917, 2024
152024
Vision-language models as a source of rewards
K Baumli, S Baveja, F Behbahani, H Chan, G Comanici, S Flennerhag, ...
arXiv preprint arXiv:2312.09187, 2023
142023
Self-consistent models and values
G Farquhar, K Baumli, Z Marinho, A Filos, M Hessel, HP van Hasselt, ...
Advances in Neural Information Processing Systems 34, 1111-1125, 2021
132021
Entropic desired dynamics for intrinsic control
S Hansen, G Desjardins, K Baumli, D Warde-Farley, N Heess, S Osindero, ...
Advances in Neural Information Processing Systems 34, 11436-11448, 2021
72021
Controlling agents using relative variational intrinsic control
DCP Warde-Farley, SS Hansen, V Mnih, KA Baumli
US Patent App. 18/025,304, 2023
2023
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–14