Các bài viết có thể truy cập công khai - Rémi MunosTìm hiểu thêm
Có tại một số nơi: 21
A distributional code for value in dopamine-based reinforcement learning
W Dabney, Z Kurth-Nelson, N Uchida, CK Starkweather, D Hassabis, ...
Nature 577 (7792), 671-675, 2020
Các cơ quan ủy nhiệm: US National Institutes of Health
Relative upper confidence bound for the k-armed dueling bandit problem
M Zoghi, S Whiteson, R Munos, M Rijke
International conference on machine learning, 10-18, 2014
Các cơ quan ủy nhiệm: Royal Netherlands Academy of Arts and Sciences
α-Rank: Multi-Agent Evaluation by Evolution
S Omidshafiei, C Papadimitriou, G Piliouras, K Tuyls, M Rowland, ...
Scientific reports 9 (1), 9937, 2019
Các cơ quan ủy nhiệm: US National Science Foundation
From poincaré recurrence to convergence in imperfect information games: Finding equilibrium via regularization
J Perolat, R Munos, JB Lespiau, S Omidshafiei, M Rowland, P Ortega, ...
International Conference on Machine Learning, 8525-8535, 2021
Các cơ quan ủy nhiệm: A*Star, Singapore, National Research Foundation, Singapore
Regret bounds for restless markov bandits
R Ortner, D Ryabko, P Auer, R Munos
International conference on algorithmic learning theory, 214-228, 2012
Các cơ quan ủy nhiệm: Austrian Science Fund
Relative confidence sampling for efficient on-line ranker evaluation
M Zoghi, SA Whiteson, M De Rijke, R Munos
Proceedings of the 7th ACM international conference on Web search and data …, 2014
Các cơ quan ủy nhiệm: Royal Netherlands Academy of Arts and Sciences
Generalized emphatic temporal difference learning: Bias-variance analysis
A Hallak, A Tamar, R Munos, S Mannor
Proceedings of the AAAI Conference on Artificial Intelligence 30 (1), 2016
Các cơ quan ủy nhiệm: European Commission
Regret bounds for restless Markov bandits
R Ortner, D Ryabko, P Auer, R Munos
Theoretical Computer Science 558, 62-76, 2014
Các cơ quan ủy nhiệm: Austrian Science Fund
Navigating the landscape of multiplayer games
S Omidshafiei, K Tuyls, WM Czarnecki, FC Santos, M Rowland, J Connor, ...
Nature communications 11 (1), 5603, 2020
Các cơ quan ủy nhiệm: Fundação para a Ciência e a Tecnologia, Portugal
Learning in two-player zero-sum partially observable Markov games with perfect recall
T Kozuno, P Ménard, R Munos, M Valko
Advances in Neural Information Processing Systems 34, 11987-11998, 2021
Các cơ quan ủy nhiệm: Natural Sciences and Engineering Research Council of Canada, Science …
Fast LSTD using stochastic approximation: Finite time analysis and application to traffic control
LA Prashanth, N Korda, R Munos
Machine Learning and Knowledge Discovery in Databases: European Conference …, 2014
Các cơ quan ủy nhiệm: UK Engineering and Physical Sciences Research Council
Fast rates for maximum entropy exploration
D Tiapkin, D Belomestny, D Calandriello, E Moulines, R Munos, ...
International Conference on Machine Learning, 34161-34221, 2023
Các cơ quan ủy nhiệm: German Research Foundation, Agence Nationale de la Recherche
Planning in entropy-regularized Markov decision processes and games
JB Grill, O Darwiche Domingues, P Ménard, R Munos, M Valko
Advances in Neural Information Processing Systems 32, 2019
Các cơ quan ủy nhiệm: European Commission, Agence Nationale de la Recherche
Adapting to game trees in zero-sum imperfect information games
C Fiegel, P Ménard, T Kozuno, R Munos, V Perchet, M Valko
International Conference on Machine Learning, 10093-10135, 2023
Các cơ quan ủy nhiệm: Agence Nationale de la Recherche
Optimistic posterior sampling for reinforcement learning with few samples and tight guarantees
D Tiapkin, D Belomestny, D Calandriello, E Moulines, R Munos, ...
Advances in Neural Information Processing Systems 35, 10737-10751, 2022
Các cơ quan ủy nhiệm: German Research Foundation, Agence Nationale de la Recherche
Spectral bandits
T Kocák, R Munos, B Kveton, S Agrawal, M Valko
Journal of Machine Learning Research 21 (218), 1-44, 2020
Các cơ quan ủy nhiệm: European Commission, Agence Nationale de la Recherche
Fast gradient descent for drifting least squares regression, with application to bandits
N Korda, LA Prashanth, R Munos
Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015
Các cơ quan ủy nhiệm: UK Engineering and Physical Sciences Research Council
Optimistic optimization of a Brownian
JB Grill, M Valko, R Munos
Advances in Neural Information Processing Systems 31, 2018
Các cơ quan ủy nhiệm: European Commission
Model-free posterior sampling via learning rate randomization
D Tiapkin, D Belomestny, D Calandriello, E Moulines, R Munos, ...
Advances in Neural Information Processing Systems 36, 73719-73774, 2023
Các cơ quan ủy nhiệm: Agence Nationale de la Recherche
Local and adaptive mirror descents in extensive-form games
C Fiegel, P Ménard, T Kozuno, R Munos, V Perchet, M Valko
Advances in Neural Information Processing Systems 37, 56703-56737, 2024
Các cơ quan ủy nhiệm: European Commission, Agence Nationale de la Recherche
Chương trình máy tính sẽ tự động xác định thông tin xuất bản và thông tin về nhà tài trợ