Marc Lanctot

Citée par

	Toutes	Depuis 2019
Citations	43028	36083
indice h	43	39
indice i10	68	62

8000

4000

2000

6000

20142015201620172018201920202021202220232024117 121 916 1818 3152 4406 5337 6212 6637 7290 6182

Accès public

Tout afficher

5 articles

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Thore GraepelGlobal Lead Computational Science, AI & ML at Altos Labs and Chair of Machine Learning, UCLAdresse e-mail validée de ucl.ac.uk
Karl TuylsResearch Scientist, Entrepreneur, ex-Google DeepMind, Prof at University of LiverpoolAdresse e-mail validée de hcompany.ai
David SilverDeepMind, UCLAdresse e-mail validée de google.com
Michael BowlingAmii, University of AlbertaAdresse e-mail validée de ualberta.ca
Julian SchrittwieserDeepMindAdresse e-mail validée de furidamu.org
Laurent SifreH CompanyAdresse e-mail validée de polytechnique.edu
Arthur GuezGoogle DeepMindAdresse e-mail validée de google.com
Joel Z LeiboResearch scientistAdresse e-mail validée de google.com
julien perolatDeepMindAdresse e-mail validée de google.com
Timothy P. LillicrapDirector of Research, Google DeepMindAdresse e-mail validée de google.com
Ioannis AntonoglouDeepmind, UCLAdresse e-mail validée de reflection.ai
Audrūnas GruslysAdresse e-mail validée de gruslys.com
Chris J. MaddisonUniversity of TorontoAdresse e-mail validée de cs.toronto.edu
George van den DriesscheDeepMindAdresse e-mail validée de deepmind.com
Neil BurchSony AI & Alberta Machine Intelligence Institute, University of AlbertaAdresse e-mail validée de ualberta.ca
Vinicius ZambaldiGoogle DeepmindAdresse e-mail validée de google.com
Thomas HubertGoogle DeepmindAdresse e-mail validée de google.com
Rémi MunosGoogle DeepMindAdresse e-mail validée de inria.fr
Nal KalchbrennerGoogle DeepMindAdresse e-mail validée de google.com
koray kavukcuogluDeepMindAdresse e-mail validée de kavukcuoglu.org

Suivre

Marc Lanctot

Research Scientist, Google DeepMind

Adresse e-mail validée de google.com - Page d'accueil

Artificial Intelligence Game Theory Search Multiagent Systems Reinforcement Learning


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Mastering the game of Go with deep neural networks and tree search D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ... Nature 529 (7587), 484-489, 2016	20399	2016
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ... Science 362 (6419), 1140-1144, 2018	7129*	2018
Dueling Network Architectures for Deep Reinforcement Learning Z Wang, T Schaul, M Hessel, H van Hasselt, M Lanctot, N de Freitas arXiv preprint arXiv:1511.06581, 2016	5317	2016
Value-decomposition networks for cooperative multi-agent learning based on team reward P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ... Proceedings of the 17th international conference on autonomous agents and …, 2018	1920*	2018
Deep Q-learning from Demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Association for the Advancement of Artificial Intelligence (AAAI), 2018	1320	2018
Multi-agent Reinforcement Learning in Sequential Social Dilemmas JZ Leibo, V Zambaldi, M Lanctot, J Marecki, T Graepel AAMAS, 2017	927	2017
A unified game-theoretic approach to multiagent reinforcement learning M Lanctot, V Zambaldi, A Gruslys, A Lazaridou, K Tuyls, J Pérolat, D Silver, ... arXiv preprint arXiv:1711.00832, 2017	775	2017
The hanabi challenge: A new frontier for ai research N Bard, JN Foerster, S Chandar, N Burch, M Lanctot, HF Song, E Parisotto, ... Artificial Intelligence 280, 103216, 2020	438	2020
Fictitious Self-Play in Extensive-Form Games J Heinrich, M Lanctot, D Silver International Conference on Machine Learning, 2015	407	2015
Monte Carlo sampling for regret minimization in extensive games M Lanctot, K Waugh, M Zinkevich, M Bowling Advances in neural information processing systems 22, 1078-1086, 2009	396	2009
OpenSpiel: A Framework for Reinforcement Learning in Games M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ... arXiv preprint arXiv:1908.09453, 2019	280	2019
Memory-efficient backpropagation through time A Gruslys, R Munos, I Danihelka, M Lanctot, A Graves Advances In Neural Information Processing Systems, 4125-4133, 2016	269*	2016
Mastering the game of Stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022	216	2022
Emergent Communication through Negotiation K Cao, A Lazaridou, M Lanctot, JZ Leibo, K Tuyls, S Clark arXiv preprint arXiv:1804.03980, 2018	195	2018
Actor-critic policy optimization in partially observable multiagent environments S Srinivasan, M Lanctot, V Zambaldi, J Pérolat, K Tuyls, R Munos, ... Advances in Neural Information Processing Systems, 3422-3435, 2018	172	2018
α-Rank: Multi-Agent Evaluation by Evolution S Omidshafiei, C Papadimitriou, G Piliouras, K Tuyls, M Rowland, ... Scientific reports 9 (1), 9937, 2019	143	2019
Convolution by evolution: Differentiable pattern producing networks C Fernando, D Banarse, M Reynolds, F Besse, D Pfau, M Jaderberg, ... Proceedings of the Genetic and Evolutionary Computation Conference 2016, 109-116, 2016	138	2016
Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research JZ Leibo, E Hughes, M Lanctot, T Graepel arXiv preprint arXiv:1903.00742, 2019	130	2019
Real-Time Monte-Carlo Tree Search in Ms Pac-Man T Pepels, MHM Winands, M Lanctot Transactions on Computation Intelligence and AI in Games, 2014	121	2014
A Generalized Training Approach for Multiagent Learning P Muller, S Omidshafiei, M Rowland, K Tuyls, J Perolat, S Liu, D Hennes, ... arXiv preprint arXiv:1909.12823, 2019	116	2019

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs