追蹤
Zuyue Fu
Zuyue Fu
在 u.northwestern.edu 的電子郵件地址已通過驗證
標題
引用次數
引用次數
年份
Actor-critic provably finds Nash equilibria of linear-quadratic mean-field games
Z Fu, Z Yang, Y Chen, Z Wang
International Conference on Learning Representations, 2019
682019
Instrumental variable value iteration for causal offline reinforcement learning
L Liao, Z Fu, Z Yang, Y Wang, M Kolar, Z Wang
arXiv preprint arXiv:2102.09907, 2021
472021
Single-timescale actor-critic provably finds globally optimal policy
Z Fu, Z Yang, Z Wang
International Conference on Learning Representations, 2020
442020
Learning from demonstration: Provably efficient adversarial policy imitation with linear function approximation
Z Liu, Y Zhang, Z Fu, Z Yang, Z Wang
International conference on machine learning, 14094-14138, 2022
25*2022
Offline reinforcement learning with instrumental variables in confounded markov decision processes
Z Fu, Z Qi, Z Wang, Z Yang, Y Xu, MR Kosorok
arXiv preprint arXiv:2209.08666, 2022
212022
False correlation reduction for offline reinforcement learning
Z Deng, Z Fu, L Wang, Z Yang, C Bai, T Zhou, Z Wang, J Jiang
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
16*2023
Decentralized single-timescale actor-critic on zero-sum two-player stochastic games
H Guo, Z Fu, Z Yang, Z Wang
International Conference on Machine Learning, 3899-3909, 2021
112021
Sample elicitation
J Wei, Z Fu, Y Liu, X Li, Z Yang, Z Wang
International Conference on Artificial Intelligence and Statistics, 2692-2700, 2021
92021
Convergent reinforcement learning with function approximation: A bilevel optimization perspective
Z Yang, Z Fu, K Zhang, Z Wang
72018
Optimistic exploration with learned features provably solves markov decision processes with neural dynamics
S Zheng, L Wang, S Qiu, Z Fu, Z Yang, C Szepesvari, Z Wang
The Eleventh International Conference on Learning Representations, 2022
32022
A two-fold structural classification method for determining the accurate ensemble of protein structures
P Tan, Z Fu, L Petridis, S Qian, D You, D Wei, J Li, L Hong
Communications in Computational Physics 25 (4), 2018
12018
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
Z Fu, Z Qi, Z Yang, Z Wang, L Wang
arXiv preprint arXiv:2212.12167, 2022
2022
On the Optimality and Complexity of Reinforcement Learning
Z Fu
Northwestern University, 2022
2022
系統目前無法執行作業,請稍後再試。
文章 1–13