Wenhao Zhan

Cited by

	All	Since 2019
Citations	408	408
h-index	10	10
i10-index	10	10

240

120

180

202120222023202411 43 123 231

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jason D. LeeAssociate Professor of Electrical Engineering and Computer Science, Princeton UniversityVerified email at princeton.edu
Wen SunAssistant Professor, Cornell UniversityVerified email at cornell.edu
Masatoshi UeharaGenentechVerified email at gene.com
Baihe HuangUniversity of California, BerkeleyVerified email at berkeley.edu
Yuxin ChenUniversity of PennsylvaniaVerified email at wharton.upenn.edu
Audrey HuangUniversity of Illinois Urbana-ChampaignVerified email at illinois.edu
Yuejie ChiCarnegie Mellon UniversityVerified email at cmu.edu
Kianté BrantleyAssistant Professor, Harvard UniversityVerified email at g.harvard.edu
Jonathan D. ChangResearch Scientist, Databricks MosaicVerified email at cornell.edu
Nan JiangAssociate Professor of Computer Science, UIUCVerified email at illinois.edu
Nathan KallusCornell UniversityVerified email at cornell.edu
Shicong CenPhD Candidate, Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Owen OertellUndergraduate, Cornell UniversityVerified email at cornell.edu
Zhaolin GaoCornell UniversityVerified email at cornell.edu
Gokul SwamyPhD Candidate, Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Dipendra MisraStaff Research Scientist, Mosaic Team, DatabricksVerified email at databricks.com
Thorsten JoachimsProfessor of Computer Science, Cornell UniversityVerified email at cs.cornell.edu
J. Andrew BagnellCarnegie Mellon UniversityVerified email at ri.cmu.edu
Gen LiStatistics, The Chinese University of Hong KongVerified email at cuhk.edu.hk
Simon Shaolei DuAssistant Professor, School of Computer Science and Engineering, University of WashingtonVerified email at cs.washington.edu

Wenhao Zhan

Graduate Student, Princeton University

Verified email at princeton.edu - Homepage

reinforcement learning theory statistics


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Offline reinforcement learning with realizability and single-policy concentrability W Zhan, B Huang, A Huang, N Jiang, J Lee Conference on Learning Theory, 2730-2775, 2022	125	2022
Policy mirror descent for regularized reinforcement learning: A generalized framework with linear convergence W Zhan, S Cen, B Huang, Y Chen, JD Lee, Y Chi SIAM Journal on Optimization 33 (2), 1061-1091, 2023	85	2023
Provable Offline Preference-Based Reinforcement Learning W Zhan, M Uehara, N Kallus, JD Lee, W Sun The Twelfth International Conference on Learning Representations, 2024	51*	2024
Pac reinforcement learning for predictive state representations W Zhan, M Uehara, W Sun, JD Lee The Eleventh International Conference on Learning Representations, 2023	46	2023
Provable Reward-Agnostic Preference-Based Reinforcement Learning W Zhan, M Uehara, W Sun, JD Lee The Twelfth International Conference on Learning Representations, 2024	27*	2024
Dataset Reset Policy Optimization for RLHF JD Chang, W Zhan, O Oertell, K Brantley, D Misra, JD Lee, W Sun arXiv preprint arXiv:2404.08495, 2024	19	2024
REBEL: Reinforcement Learning via Regressing Relative Rewards Z Gao, JD Chang, W Zhan, O Oertell, G Swamy, K Brantley, T Joachims, ... arXiv preprint arXiv:2404.16767, 2024	14	2024
Decentralized optimistic hyperpolicy mirror descent: Provably no-regret learning in markov games W Zhan, JD Lee, Z Yang The Eleventh International Conference on Learning Representations, 2023	13	2023
Reward-agnostic fine-tuning: Provable statistical benefits of hybrid reinforcement learning G Li, W Zhan, JD Lee, Y Chi, Y Chen Advances in Neural Information Processing Systems 36, 2023	11	2023
Optimal multi-distribution learning Z Zhang, W Zhan, Y Chen, SS Du, JD Lee The Thirty Seventh Annual Conference on Learning Theory, 5220-5223, 2024	10	2024
Provably Efficient CVaR RL in Low-rank MDPs Y Zhao, W Zhan, X Hu, H Leung, F Farnia, W Sun, JD Lee The Twelfth International Conference on Learning Representations, 2024	3	2024
Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization A Huang, W Zhan, T Xie, JD Lee, W Sun, A Krishnamurthy, DJ Foster arXiv preprint arXiv:2407.13399, 2024	2*	2024
Over-the-Air Statistical Estimation of Sparse Models CZ Lee, LP Barnes, W Zhan, A Özgür 2021 IEEE Global Communications Conference (GLOBECOM), 1-6, 2021	1	2021
Delay Optimal Cross-Layer Scheduling Over Markov Channels with Power Constraint W Zhan, H Tang, J Wang 2020 IEEE International Symposium on Broadband Multimedia Systems and …, 2020	1	2020
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF Z Gao, W Zhan, JD Chang, G Swamy, K Brantley, JD Lee, W Sun arXiv preprint arXiv:2410.04612, 2024		2024
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank W Zhan, S Fujimoto, Z Zhu, JD Lee, DR Jiang, Y Efroni arXiv preprint arXiv:2410.01101, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–16

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors