Zhaoran Wang

引用次數

	全部	自 2019 年
引文	9428	8773
H 指數	49	48
i10 指數	114	112

2700

1350

675

2025

2014201520162017201820192020202120222023202433 66 134 198 202 305 659 1268 1707 2189 2629

公開取用

查看全部

53 篇文章

0 篇文章

可供使用

無法使用

根據資金強制性政策

追蹤

Zhaoran Wang

Associate Professor at Northwestern University

在 northwestern.edu 的電子郵件地址已通過驗證 - 首頁

Deep Reinforcement Learning Data-Driven Decision-Making Optimization Under Uncertainty Nonconvex


標題按引用次數排序按年份排序按標題排序	引用次數引用次數	年份
Provably Efficient Reinforcement Learning with Linear Function Approximation C Jin, Z Yang, Z Wang, MI Jordan Mathematics of Operations Research/Annual Conference on Learning Theory, 2023	855	2023
A Theoretical Analysis of Deep Q-Learning J Fan, Z Wang, Y Xie, Z Yang Learning for Dynamics and Control, 2020	821	2020
Is Pessimism Provably Efficient for Offline RL? Y Jin, Z Yang, Z Wang Mathematics of Operations Research/International Conference on Machine Learning, 2024	430	2024
Provably Efficient Exploration in Policy Optimization Q Cai, Z Yang, C Jin, Z Wang International Conference on Machine Learning, 2020	316	2020
A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic M Hong, HT Wai, Z Wang, Z Yang SIAM Journal on Optimization, 2022	295*	2022
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence L Wang, Q Cai, Z Yang, Z Wang International Conference on Learning Representations, 2020	265	2020
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy B Liu, Q Cai, Z Yang, Z Wang Advances in Neural Information Processing Systems, 2019	228*	2019
Optimal Computational and Statistical Rates of Convergence for Sparse Nonconvex Learning Problems Z Wang, H Liu, T Zhang Annals of Statistics, 2014	209	2014
Multi-Agent Reinforcement Learning via Double-Averaging Primal-Dual Optimization HT Wai, Z Yang, Z Wang, M Hong Advances in Neural Information Processing Systems, 2018	207	2018
A Strictly Contractive Peaceman--Rachford Splitting Method for Convex Programming B He, H Liu, Z Wang, X Yuan SIAM Journal on Optimization, 2014	197	2014
A Nonconvex Optimization Framework for Low Rank Matrix Estimation T Zhao, Z Wang, H Liu Advances in Neural Information Processing Systems, 2015	195*	2015
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization D Ding, X Wei, Z Yang, Z Wang, MR Jovanović International Conference on Artificial Intelligence and Statistics, 2021	182	2021
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium Q Xie, Y Chen, Z Wang, Z Yang Mathematics of Operations Research/Annual Conference on Learning Theory, 2022	162	2022
Neural Temporal-Difference and Q-Learning Provably Converge to Global Optima Q Cai, Z Yang, JD Lee, Z Wang Mathematics of Operations Research/Advances in Neural Information Processing …, 2019	155*	2019
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost Z Yang, Y Chen, M Hong, Z Wang Advances in Neural Information Processing Systems, 2019	153	2019
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning C Bai, L Wang, Z Yang, Z Deng, A Garg, P Liu, Z Wang International Conference on Learning Representations, 2022	150	2022
Bridging Exploration and General Function Approximation in Reinforcement Learning: Provably Efficient Kernel and Neural Value Iterations Z Yang, C Jin, Z Wang, M Wang, MI Jordan Advances in Neural Information Processing Systems, 2020	149*	2020
A Near-Optimal Algorithm for Stochastic Bilevel Optimization via Double-Momentum P Khanduri, S Zeng, M Hong, HT Wai, Z Wang, Z Yang Advances in Neural Information Processing Systems, 2021	144	2021
High-Dimensional Expectation-Maximization Algorithm: Statistical Optimization and Asymptotic Normality Z Wang, Q Gu, Y Ning, H Liu Advances in Neural Information Processing Systems, 2015	143	2015
Convergent Policy Optimization for Safe Reinforcement Learning M Yu, Z Yang, M Kolar, Z Wang Advances in Neural Information Processing Systems, 2019	131	2019

系統目前無法執行作業，請稍後再試。

文章 1–20

每年的引文數

重複引用

合併引文

新增共同作者共同作者

追蹤

引用次數