Will Dabney

Citeras av

	Alla	Sedan 2019
Citat	12097	11404
h-index	34	33
i10-index	50	48

2600

1300

650

1950

20162017201820192020202120222023202433 100 447 893 1411 1909 2216 2509 2449

Offentlig åtkomst

Visa alla

4 artiklar

0 artiklar

tillgänglig

inte tillgänglig

Enligt krav från finansiärer

Medförfattare

Rémi MunosGoogle DeepMindVerifierad e-postadress på inria.fr
Mark RowlandResearch Scientist, Google DeepMindVerifierad e-postadress på google.com
Marc G. BellemareReliant AIVerifierad e-postadress på reliant.ai
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLVerifierad e-postadress på google.com
Tom SchaulSenior Staff Scientist, DeepMindVerifierad e-postadress på nyu.edu
Mohammad Gheshlaghi AzarCohereVerifierad e-postadress på cohere.com
Andre BarretoResearch Scientist, Google DeepMindVerifierad e-postadress på google.com
David SilverDeepMind, UCLVerifierad e-postadress på google.com
Bilal PiotGoogle DeepmindVerifierad e-postadress på google.com
Nicolas HeessDeepMindVerifierad e-postadress på google.com
Philip ThomasUniversity of Massachusetts AmherstVerifierad e-postadress på cs.umass.edu
Sridhar MahadevanDirector, Data Science Lab, Adobe Research & Professor, University of Massachusetts, AmherstVerifierad e-postadress på cs.umass.edu
Amy McGovernUniversity of OklahomaVerifierad e-postadress på ou.edu
Bo LiuPhD, AAAI SM, IEEE SMVerifierad e-postadress på cs.umass.edu
Georg OstrovskiGoogle DeepMind

Följ

Will Dabney

DeepMind

Verifierad e-postadress på google.com - Startsida

Reinforcement Learning Machine Learning Artificial Intelligence


Titel Sortera efter citat Sortera efter år Sortera efter titel	Citeras av Citeras av	År
Rainbow: Combining improvements in deep reinforcement learning M Hessel, J Modayil, H Van Hasselt, T Schaul, G Ostrovski, W Dabney, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2829	2018
A distributional perspective on reinforcement learning MG Bellemare, W Dabney, R Munos arXiv preprint arXiv:1707.06887, 2017	1902	2017
Distributional reinforcement learning with quantile regression W Dabney, M Rowland, M Bellemare, R Munos Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	896	2018
Distributed distributional deterministic policy gradients G Barth-Maron, MW Hoffman, D Budden, W Dabney, D Horgan, D Tb, ... arXiv preprint arXiv:1804.08617, 2018	671	2018
Successor features for transfer in reinforcement learning A Barreto, W Dabney, R Munos, JJ Hunt, T Schaul, HP van Hasselt, ... Advances in neural information processing systems 30, 2017	670	2017
Implicit quantile networks for distributional reinforcement learning W Dabney, G Ostrovski, D Silver, R Munos International conference on machine learning, 1096-1105, 2018	633	2018
Recurrent experience replay in distributed reinforcement learning S Kapturowski, G Ostrovski, J Quan, R Munos, W Dabney International conference on learning representations, 2018	581	2018
A distributional code for value in dopamine-based reinforcement learning W Dabney, Z Kurth-Nelson, N Uchida, CK Starkweather, D Hassabis, ... Nature 577 (7792), 671-675, 2020	456	2020
The cramer distance as a solution to biased wasserstein gradients MG Bellemare, I Danihelka, W Dabney, S Mohamed, ... arXiv preprint arXiv:1705.10743, 2017	428	2017
Revisiting fundamentals of experience replay W Fedus, P Ramachandran, R Agarwal, Y Bengio, H Larochelle, ... International conference on machine learning, 3061-3071, 2020	314	2020
Deep reinforcement learning and its neuroscientific implications M Botvinick, JX Wang, W Dabney, KJ Miller, Z Kurth-Nelson Neuron 107 (4), 603-616, 2020	219	2020
Fast task inference with variational intrinsic successor features S Hansen, W Dabney, A Barreto, T Van de Wiele, D Warde-Farley, V Mnih arXiv preprint arXiv:1906.05030, 2019	181	2019
Distributional reinforcement learning MG Bellemare, W Dabney, M Rowland MIT Press, 2023	153	2023
An analysis of categorical distributional reinforcement learning M Rowland, M Bellemare, W Dabney, R Munos, YW Teh International Conference on Artificial Intelligence and Statistics, 29-37, 2018	140	2018
The reactor: A fast and sample-efficient actor-critic agent for reinforcement learning A Gruslys, W Dabney, MG Azar, B Piot, M Bellemare, R Munos arXiv preprint arXiv:1704.04651, 2017	110	2017
Temporally-extended {\epsilon}-greedy exploration W Dabney, G Ostrovski, A Barreto arXiv preprint arXiv:2006.01782, 2020	109	2020
A geometric perspective on optimal representations for reinforcement learning M Bellemare, W Dabney, R Dadashi, A Ali Taiga, PS Castro, N Le Roux, ... Advances in neural information processing systems 32, 2019	107	2019
Understanding and preventing capacity loss in reinforcement learning C Lyle, M Rowland, W Dabney arXiv preprint arXiv:2204.09560, 2022	105	2022
On the expressivity of markov reward D Abel, W Dabney, A Harutyunyan, MK Ho, M Littman, D Precup, S Singh Advances in Neural Information Processing Systems 34, 7799-7812, 2021	102	2021
Statistics and samples in distributional reinforcement learning M Rowland, R Dadashi, S Kumar, R Munos, MG Bellemare, W Dabney International Conference on Machine Learning, 5528-5536, 2019	101	2019

Systemet kan inte utföra åtgärden just nu. Försök igen senare.

Artiklar 1–20

Citat per år

Dubblettcitat

Sammanfogade citat

Lägg till medförfattareMedförfattare

Följ

Citeras av

Medförfattare