A generalist agent S Reed, K Zolna, E Parisotto, SG Colmenarejo, A Novikov, G Barth-Maron, ... arXiv preprint arXiv:2205.06175, 2022 | 951 | 2022 |
Imitating latent policies from observation AD Edwards, H Sahni, Y Schroecker, CL Isbell International Conference on Machine Learning (ICML 2019), 2019 | 163 | 2019 |
Genie: Generative interactive environments J Bruce, MD Dennis, A Edwards, J Parker-Holder, Y Shi, E Hughes, M Lai, ... Forty-first International Conference on Machine Learning, 2024 | 78 | 2024 |
Forward-backward reinforcement learning AD Edwards, L Downs, JC Davidson ICRA Machine Learning in Planning and Control of Robot Motion Workshop, 2018 | 51 | 2018 |
Estimating Q (s, s') with Deep Deterministic Dynamics Gradients AD Edwards, H Sahni, R Liu, J Hung, A Jain, R Wang, A Ecoffet, T Miconi, ... International Conference on Machine Learning (ICML 2020), 2020 | 23 | 2020 |
Perceptual reward functions A Edwards, C Isbell, A Takanishi IJCAI Deep Reinforcement Learning: Frontiers and Challenges Workshop, 2016 | 23 | 2016 |
Perceptual Values from Observation AD Edwards, CL Isbell ICML Self-Supervised Learning Workshop, 2019 | 19 | 2019 |
Cross-domain perceptual reward functions AD Edwards, S Sood, CL Isbell Jr The Multi-disciplinary Conference on Reinforcement Learning and Decision …, 2017 | 11 | 2017 |
Learning few-shot imitation as cultural transmission A Bhoopchand, B Brownfield, A Collister, A Dal Lago, A Edwards, ... Nature Communications 14 (1), 7536, 2023 | 10 | 2023 |
Higher order Q-learning A Edwards, WM Pottenger 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement …, 2011 | 8 | 2011 |
Learning Robust Real-Time Cultural Transmission without Human Data CGI Team, A Bhoopchand, B Brownfield, A Collister, AD Lago, A Edwards, ... arXiv preprint arXiv:2203.00715, 2022 | 6 | 2022 |
Perceptual Goal Specifications for Reinforcement Learning AD Edwards PhD thesis proposal, Georgia Institute of Technology, 2017 | 4 | 2017 |
Transferring Agent Behaviors from Videos via Motion GANs AD Edwards, CL Isbell Jr NIPS Deep Reinforcement Learning Symposium, 2017 | 3 | 2017 |
Expressing Tasks Robustly via Multiple Discount Factors A Edwards, ML Littman, CL Isbell The Multi-disciplinary Conference on Reinforcement Learning and Decision …, 2015 | 2 | 2015 |
Autoregressively generating sequences of data elements defining actions to be performed by an agent T Erez, A Novikov, E Parisotto, JW Rae, K Zolna, MMR Denil, ... US Patent App. 17/410,689, 2023 | 1 | 2023 |
Autoregressively generating sequences of data elements defining actions to be performed by an agent SE Reed, K Zolna, E Parisotto, T Erez, A Novikov, JW Rae, MMR Denil, ... US Patent App. 18/292,165, 2024 | | 2024 |
Emulation and Imitation via Perceptual Goal Specifications AD Edwards PhD thesis, Georgia Institute of Technology, 2019 | | 2019 |