Actor-critic provably finds Nash equilibria of linear-quadratic mean-field games Z Fu, Z Yang, Y Chen, Z Wang International Conference on Learning Representations, 2019 | 68 | 2019 |
Instrumental variable value iteration for causal offline reinforcement learning L Liao, Z Fu, Z Yang, Y Wang, M Kolar, Z Wang arXiv preprint arXiv:2102.09907, 2021 | 47 | 2021 |
Single-timescale actor-critic provably finds globally optimal policy Z Fu, Z Yang, Z Wang International Conference on Learning Representations, 2020 | 44 | 2020 |
Learning from demonstration: Provably efficient adversarial policy imitation with linear function approximation Z Liu, Y Zhang, Z Fu, Z Yang, Z Wang International conference on machine learning, 14094-14138, 2022 | 25* | 2022 |
Offline reinforcement learning with instrumental variables in confounded markov decision processes Z Fu, Z Qi, Z Wang, Z Yang, Y Xu, MR Kosorok arXiv preprint arXiv:2209.08666, 2022 | 21 | 2022 |
False correlation reduction for offline reinforcement learning Z Deng, Z Fu, L Wang, Z Yang, C Bai, T Zhou, Z Wang, J Jiang IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 16* | 2023 |
Decentralized single-timescale actor-critic on zero-sum two-player stochastic games H Guo, Z Fu, Z Yang, Z Wang International Conference on Machine Learning, 3899-3909, 2021 | 11 | 2021 |
Sample elicitation J Wei, Z Fu, Y Liu, X Li, Z Yang, Z Wang International Conference on Artificial Intelligence and Statistics, 2692-2700, 2021 | 9 | 2021 |
Convergent reinforcement learning with function approximation: A bilevel optimization perspective Z Yang, Z Fu, K Zhang, Z Wang | 7 | 2018 |
Optimistic exploration with learned features provably solves markov decision processes with neural dynamics S Zheng, L Wang, S Qiu, Z Fu, Z Yang, C Szepesvari, Z Wang The Eleventh International Conference on Learning Representations, 2022 | 3 | 2022 |
A two-fold structural classification method for determining the accurate ensemble of protein structures P Tan, Z Fu, L Petridis, S Qian, D You, D Wei, J Li, L Hong Communications in Computational Physics 25 (4), 2018 | 1 | 2018 |
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information Z Fu, Z Qi, Z Yang, Z Wang, L Wang arXiv preprint arXiv:2212.12167, 2022 | | 2022 |
On the Optimality and Complexity of Reinforcement Learning Z Fu Northwestern University, 2022 | | 2022 |