Kamu erişimi zorunlu olan makaleler - julien perolatDaha fazla bilgi edinin
Bir yerde sunuluyor: 8
α-Rank: Multi-Agent Evaluation by Evolution
S Omidshafiei, C Papadimitriou, G Piliouras, K Tuyls, M Rowland, ...
Scientific reports 9 (1), 9937, 2019
Zorunlu olanlar: US National Science Foundation
From poincaré recurrence to convergence in imperfect information games: Finding equilibrium via regularization
J Perolat, R Munos, JB Lespiau, S Omidshafiei, M Rowland, P Ortega, ...
International Conference on Machine Learning, 8525-8535, 2021
Zorunlu olanlar: A*Star, Singapore, National Research Foundation, Singapore
Generalizing the Wilcoxon rank-sum test for interval data
J Perolat, I Couso, K Loquin, O Strauss
International Journal of Approximate Reasoning 56, 108-121, 2015
Zorunlu olanlar: National Institute of Health and Medical Research, France
Actor-critic fictitious play in simultaneous move multistage games
J Perolat, B Piot, O Pietquin
International Conference on Artificial Intelligence and Statistics, 919-928, 2018
Zorunlu olanlar: European Commission
Scalable deep reinforcement learning algorithms for mean field games
M Lauriere, S Perrin, S Girgin, P Muller, A Jain, T Cabannes, G Piliouras, ...
International conference on machine learning, 12078-12095, 2022
Zorunlu olanlar: A*Star, Singapore, National Research Foundation, Singapore
Learning Nash equilibrium for general-sum Markov games from batch data
J Pérolat, F Strub, B Piot, O Pietquin
Artificial intelligence and statistics, 232-241, 2017
Zorunlu olanlar: European Commission
Navigating the landscape of multiplayer games
S Omidshafiei, K Tuyls, WM Czarnecki, FC Santos, M Rowland, J Connor, ...
Nature communications 11 (1), 5603, 2020
Zorunlu olanlar: Fundação para a Ciência e a Tecnologia, Portugal
Softened approximate policy iteration for Markov games
J Pérolat, B Piot, M Geist, B Scherrer, O Pietquin
International Conference on Machine Learning, 1860-1868, 2016
Zorunlu olanlar: European Commission
Yayıncılık ve maddi kaynak bilgileri otomatik olarak bir bilgisayar programı tarafından belirlenmektedir