Nonstationary bandit learning via predictive sampling Y Liu, B Van Roy, K Xu International Conference on Artificial Intelligence and Statistics, 6215-6244, 2023 | 24 | 2023 |
Continual learning as computationally constrained reinforcement learning S Kumar, H Marklund, A Rao, Y Zhu, HJ Jeon, Y Liu, B Van Roy arXiv preprint arXiv:2307.04345, 2023 | 20 | 2023 |
A definition of non-stationary bandits Y Liu, X Kuang, B Van Roy arXiv preprint arXiv:2302.12202, 2023 | 11 | 2023 |
Gaussian imagination in bandit learning Y Liu, AM Devraj, B Van Roy, K Xu arXiv preprint arXiv:2201.01902, 2022 | 9 | 2022 |
Non-stationary contextual bandit learning via neural predictive ensemble sampling Z Zhu, Y Liu, X Kuang, B Van Roy arXiv preprint arXiv:2310.07786, 2023 | 3 | 2023 |