Alex Ayoub

Citado por

	Todos	Desde 2020
Citações	427	426
Índice h	5	5
Índice i10	3	3

140

105

20202021202220232024202512 67 97 130 107 13

Acesso público

Ver todos

2 artigos

0 artigo

disponível

não disponível

Com base nas autorizações de financiamento

Coautores

Csaba SzepesvariDeepMind & University of AlbertaE-mail confirmado em cs.ualberta.ca
Shuai LiuUniversity of AlbertaE-mail confirmado em ualberta.ca
Dale SchuurmansUniversity of Alberta, Google DeepMindE-mail confirmado em cs.ualberta.ca
Matthew RegehrUniversity of WaterlooE-mail confirmado em uwaterloo.ca
David JanzUniversity of OxfordE-mail confirmado em cam.ac.uk
Johannes KirschnerSwiss Data Science Center, ETH ZurichE-mail confirmado em sdsc.ethz.ch

Seguir

Alex Ayoub

Department of Computing Science, University of Alberta

E-mail confirmado em ualberta.ca

Reinforcement Learning


Título Ordenar por citações Ordenar por ano Ordenar por título	Citado por Citado por	Ano
Model-based reinforcement learning with value-targeted regression A Ayoub, Z Jia, C Szepesvari, M Wang, L Yang International Conference on Machine Learning, 463-474, 2020	349	2020
Randomized exploration in reinforcement learning with general value function approximation H Ishfaq, Q Cui, V Nguyen, A Ayoub, Z Yang, Z Wang, D Precup, L Yang International Conference on Machine Learning, 4607-4616, 2021	47	2021
An elementary proof that q-learning converges almost surely MT Regehr, A Ayoub arXiv preprint arXiv:2108.02827, 2021	11	2021
Exploration via linearly perturbed loss minimisation D Janz, S Liu, A Ayoub, C Szepesvári International Conference on Artificial Intelligence and Statistics, 721-729, 2024	9	2024
Switching the loss reduces the cost in batch reinforcement learning A Ayoub, K Wang, V Liu, S Robertson, J McInerney, D Liang, N Kallus, ... Forty-first International Conference on Machine Learning, 2024	5	2024
Managing temporal resolution in continuous value estimation: a fundamental trade-off ZV Zhang, J Kirschner, J Zhang, F Zanini, A Ayoub, M Dehghan, ... Advances in Neural Information Processing Systems 36, 62519-62548, 2023	4	2023
Almost Free: Self-concordance in Natural Exponential Families and an Application to Bandits S Liu, A Ayoub, F Sentenac, X Tan, C Szepesvári arXiv preprint arXiv:2410.01112, 2024	1	2024
Mitigating the curse of horizon in Monte-Carlo returns A Ayoub, D Szepesvari, F Zanini, B Chan, D Gupta, BC da Silva, ... Reinforcement Learning Conference, 2024	1	2024
Switching the Loss Reduces the Cost in Batch (Offline) Reinforcement Learning A Ayoub, K Wang, V Liu, S Robertson, J McInerney, D Liang, N Kallus, ... arXiv preprint arXiv:2403.05385, 2024		2024
Towards Sample Efficient Reinforcement Learning with Function Approximation A Ayoub		2021
Does weighting improve matrix factorization for recommender systems? A Ayoub, S Robertson, D Liang, H Steck, N Kallus THE WEB CONFERENCE 2025, 0
Resmax: An Alternative Soft-Greedy Operator for Reinforcement Learning E Miahi, R MacQueen, A Ayoub, A Masoumzadeh, M White Transactions on Machine Learning Research, 0

O sistema não pode executar a operação agora. Tente novamente mais tarde.

Artigos 1–12

Citações por ano

Citações duplicadas

Citações mescladas

Adicionar coautoresCoautores

Seguir

Citado por

Coautores