Risk-sensitive reinforcement learning: Near-optimal risk-sample tradeoff in regret Y Fei, Z Yang, Y Chen, Z Wang, Q Xie Advances in Neural Information Processing Systems 33, 22384-22395, 2020 | 77 | 2020 |
Exponential bellman equation and improved regret bounds for risk-sensitive reinforcement learning Y Fei, Z Yang, Y Chen, Z Wang Advances in neural information processing systems 34, 20436-20446, 2021 | 71 | 2021 |
Dynamic regret of policy optimization in non-stationary environments Y Fei, Z Yang, Z Wang, Q Xie Advances in Neural Information Processing Systems 33, 6743-6754, 2020 | 61 | 2020 |
Risk-sensitive reinforcement learning with function approximation: A debiasing approach Y Fei, Z Yang, Z Wang International Conference on Machine Learning, 3198-3207, 2021 | 52 | 2021 |
Hidden integrality of SDP relaxations for sub-Gaussian mixture models Y Fei, Y Chen Conference On Learning Theory, 1931-1965, 2018 | 51 | 2018 |
Exponential Error Rates of SDP for Block Models: Beyond Grothendieck’s Inequality Y Fei, Y Chen IEEE Transactions on Information Theory 65 (1), 551-571, 2018 | 45 | 2018 |
Achieving the Bayes error rate in synchronization and block models by SDP, robustly Y Fei, Y Chen IEEE Transactions on Information Theory 66 (6), 3929-3953, 2020 | 28 | 2020 |
Spectral Frank-Wolfe algorithm: Strict complementarity and linear convergence L Ding, Y Fei, Q Xu, C Yang International conference on machine learning, 2535-2544, 2020 | 20 | 2020 |
Achieving the bayes error rate in stochastic block model by sdp, robustly Y Fei, Y Chen Conference on Learning Theory, 1235-1269, 2019 | 18 | 2019 |
Cascaded gaps: Towards logarithmic regret for risk-sensitive reinforcement learning Y Fei, R Xu International Conference on Machine Learning, 6392-6417, 2022 | 14 | 2022 |
Cascaded gaps: Towards gap-dependent regret for risk-sensitive reinforcement learning Y Fei, R Xu arXiv preprint arXiv:2203.03110, 2022 | 8 | 2022 |
Hidden Integrality and Semirandom Robustness of SDP Relaxation for Sub-Gaussian Mixture Model Y Fei, Y Chen Mathematics of Operations Research 47 (3), 2464-2493, 2022 | 2 | 2022 |
Entropic Risk-Sensitive Reinforcement Learning: A Meta Regret Framework with Function Approximation Y Fei, Z Yang, Z Wang | 1 | 2020 |
Taming Equilibrium Bias in Risk-Sensitive Multi-Agent Reinforcement Learning Y Fei, R Xu arXiv preprint arXiv:2405.02724, 2024 | | 2024 |
Discovering Discrete Structures using SDP Relaxation: Hidden Integrality, Statistical Optimality and Semirandom Robustness Y Fei | | 2020 |
Hidden Integrality and Semi-random Robustness of SDP Y Fei, Y Chen | | |