GPT-4 Technical Report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023 | 7033* | 2023 |
Go-explore: a new approach for hard-exploration problems A Ecoffet, J Huizinga, J Lehman, KO Stanley, J Clune arXiv preprint arXiv:1901.10995, 2019 | 451 | 2019 |
First return, then explore A Ecoffet, J Huizinga, J Lehman, KO Stanley, J Clune Nature 590 (7847), 580-586, 2021 | 406 | 2021 |
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos B Baker, I Akkaya, P Zhokhov, J Huizinga, J Tang, A Ecoffet, B Houghton, ... arXiv preprint arXiv:2206.11795, 2022 | 265 | 2022 |
Weak-to-strong generalization: Eliciting strong capabilities with weak supervision C Burns, P Izmailov, JH Kirchner, B Baker, L Gao, L Aschenbrenner, ... arXiv preprint arXiv:2312.09390, 2023 | 164 | 2023 |
Reinforcement learning under moral uncertainty A Ecoffet, J Lehman International conference on machine learning, 2926-2936, 2021 | 39 | 2021 |
Exploration based language learning for text-based games A Madotto, M Namazifar, J Huizinga, P Molino, A Ecoffet, H Zheng, ... International Joint Conferences on Artificial Intelligence, 1488-1494, 2020 | 33 | 2020 |
Estimating Q(s, s’) with deep deterministic dynamics gradients A Edwards, H Sahni, R Liu, J Hung, A Jain, R Wang, A Ecoffet, T Miconi, ... International Conference on Machine Learning, 2825-2835, 2020 | 23 | 2020 |
Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft I Kanitscheider, J Huizinga, D Farhi, WH Guss, B Houghton, R Sampedro, ... arXiv preprint arXiv:2106.14876, 2021 | 20 | 2021 |
Open Questions in Creating Safe Open-ended AI: Tensions Between Control and Creativity A Ecoffet, J Clune, J Lehman Artificial Life Conference Proceedings, 27-35, 2020 | 17 | 2020 |
Montezuma’s revenge solved by go-explore, a new algorithm for hard-exploration problems (sets records on pitfall, too) A Ecoffet, J Huizinga, J Lehman, KO Stanley, J Clune Uber Engineering Blog, 2018 | 15 | 2018 |
Deep reinforcement learning based models for hard-exploration problems JM Clune, AL Ecoffet, KO Stanley, J Huizinga, JA Lehman US Patent 11,829,870, 2023 | 1 | 2023 |
Using machine learning to train and use a model to perform automatic interface actions based on video and input datasets B Baker, I Akkaya, P Zhokhov, J Huizanga, J Tang, A Ecoffet, B Houghton, ... US Patent 11,887,367, 2024 | | 2024 |