‪Zuyue Fu‬ - ‪Google 學術搜尋‬

建立我自己的個人學術檔案

引用次數

	全部	自 2020 年
引文	278	276
H 指數	8	8
i10 指數	7	7

0

90

45

20192020202120222023202420252 12 37 60 71 83 13

公開取用

6 篇文章

0 篇文章

可供使用

無法使用

根據資金強制性政策

Zuyue Fu

Zuyue Fu

Northwestern University

在 u.northwestern.edu 的電子郵件地址已通過驗證 - 首頁

Reinforcement learning Machine learning Optimization


標題按引用次數排序按年份排序按標題排序	引用次數引用次數	年份
Actor-critic provably finds Nash equilibria of linear-quadratic mean-field games Z Fu, Z Yang, Y Chen, Z Wang International Conference on Learning Representations, 2019	72	2019
Instrumental variable value iteration for causal offline reinforcement learning L Liao, Z Fu, Z Yang, Y Wang, D Ma, M Kolar, Z Wang Journal of Machine Learning Research 25 (303), 1-56, 2024	52	2024
Single-timescale actor-critic provably finds globally optimal policy Z Fu, Z Yang, Z Wang International Conference on Learning Representations, 2020	51	2020
Learning from demonstration: Provably efficient adversarial policy imitation with linear function approximation Z Liu, Y Zhang, Z Fu, Z Yang, Z Wang International conference on machine learning, 14094-14138, 2022	28*	2022
Offline reinforcement learning with instrumental variables in confounded markov decision processes Z Fu, Z Qi, Z Wang, Z Yang, Y Xu, MR Kosorok arXiv preprint arXiv:2209.08666, 2022	25	2022
False correlation reduction for offline reinforcement learning Z Deng, Z Fu, L Wang, Z Yang, C Bai, T Zhou, Z Wang, J Jiang IEEE Transactions on Pattern Analysis and Machine Intelligence 46 (2), 1199-1211, 2023	18*	2023
Decentralized single-timescale actor-critic on zero-sum two-player stochastic games H Guo, Z Fu, Z Yang, Z Wang International Conference on Machine Learning, 3899-3909, 2021	11	2021
Sample elicitation J Wei, Z Fu, Y Liu, X Li, Z Yang, Z Wang International Conference on Artificial Intelligence and Statistics, 2692-2700, 2021	9	2021
Convergent reinforcement learning with function approximation: A bilevel optimization perspective Z Yang, Z Fu, K Zhang, Z Wang	7	2018
Optimistic exploration with learned features provably solves markov decision processes with neural dynamics S Zheng, L Wang, S Qiu, Z Fu, Z Yang, C Szepesvari, Z Wang The Eleventh International Conference on Learning Representations, 2022	4	2022
A two-fold structural classification method for determining the accurate ensemble of protein structures P Tan, Z Fu, L Petridis, S Qian, D You, D Wei, J Li, L Hong Communications in Computational Physics 25 (4), 2018	1	2018
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information Z Fu, Z Qi, Z Yang, Z Wang, L Wang arXiv preprint arXiv:2212.12167, 2022		2022
On the Optimality and Complexity of Reinforcement Learning Z Fu Northwestern University, 2022		2022

系統目前無法執行作業，請稍後再試。

文章 1–13