Seguir
Alex Ayoub
Alex Ayoub
Department of Computing Science, University of Alberta
E-mail confirmado em ualberta.ca
Título
Citado por
Citado por
Ano
Model-based reinforcement learning with value-targeted regression
A Ayoub, Z Jia, C Szepesvari, M Wang, L Yang
International Conference on Machine Learning, 463-474, 2020
3492020
Randomized exploration in reinforcement learning with general value function approximation
H Ishfaq, Q Cui, V Nguyen, A Ayoub, Z Yang, Z Wang, D Precup, L Yang
International Conference on Machine Learning, 4607-4616, 2021
472021
An elementary proof that q-learning converges almost surely
MT Regehr, A Ayoub
arXiv preprint arXiv:2108.02827, 2021
112021
Exploration via linearly perturbed loss minimisation
D Janz, S Liu, A Ayoub, C Szepesvári
International Conference on Artificial Intelligence and Statistics, 721-729, 2024
92024
Switching the loss reduces the cost in batch reinforcement learning
A Ayoub, K Wang, V Liu, S Robertson, J McInerney, D Liang, N Kallus, ...
Forty-first International Conference on Machine Learning, 2024
52024
Managing temporal resolution in continuous value estimation: a fundamental trade-off
ZV Zhang, J Kirschner, J Zhang, F Zanini, A Ayoub, M Dehghan, ...
Advances in Neural Information Processing Systems 36, 62519-62548, 2023
42023
Almost Free: Self-concordance in Natural Exponential Families and an Application to Bandits
S Liu, A Ayoub, F Sentenac, X Tan, C Szepesvári
arXiv preprint arXiv:2410.01112, 2024
12024
Mitigating the curse of horizon in Monte-Carlo returns
A Ayoub, D Szepesvari, F Zanini, B Chan, D Gupta, BC da Silva, ...
Reinforcement Learning Conference, 2024
12024
Switching the Loss Reduces the Cost in Batch (Offline) Reinforcement Learning
A Ayoub, K Wang, V Liu, S Robertson, J McInerney, D Liang, N Kallus, ...
arXiv preprint arXiv:2403.05385, 2024
2024
Towards Sample Efficient Reinforcement Learning with Function Approximation
A Ayoub
2021
Does weighting improve matrix factorization for recommender systems?
A Ayoub, S Robertson, D Liang, H Steck, N Kallus
THE WEB CONFERENCE 2025, 0
Resmax: An Alternative Soft-Greedy Operator for Reinforcement Learning
E Miahi, R MacQueen, A Ayoub, A Masoumzadeh, M White
Transactions on Machine Learning Research, 0
O sistema não pode executar a operação agora. Tente novamente mais tarde.
Artigos 1–12