Regime switching bandits X Zhou, Y Xiong, N Chen, X Gao Advances in Neural Information Processing Systems 34, 4542-4554, 2021 | 39 | 2021 |
Sublinear regret for learning pomdps Y Xiong, N Chen, X Gao, X Zhou Production and Operations Management 31 (9), 3491-3504, 2022 | 24 | 2022 |
Nonparametric advertising budget allocation with inventory constraint C Yang, Y Xiong European Journal of Operational Research 285 (2), 631-641, 2020 | 15 | 2020 |
Debiasing samples from online learning using bootstrap N Chen, X Gao, Y Xiong International Conference on Artificial Intelligence and Statistics, 8514-8533, 2022 | 6 | 2022 |
No Algorithmic Collusion in Two-Player Blindfolded Game with Thompson Sampling N Chen, X Gao, Y Xiong arXiv preprint arXiv:2405.17463, 2024 | | 2024 |