Hanlin Zhu

Navedeno

	Vse	Od leta 2020
Navedbe	401	399
indeks h	8	8
indeks i10	8	8

220

110

165

201820192020202120222023202420251 1 25 34 24 64 206 45

Javni dostop

Prikaži vse

2 članka

0 člankov

na voljo

ni na voljo

Na podlagi zahtev v povezavi s financiranjem

Soavtorji

Jiantao JiaoAssistant Professor of EECS and Statistics, University of California, BerkeleyPreverjeni e-poštni naslov na berkeley.edu
Banghua ZhuUniversity of California, BerkeleyPreverjeni e-poštni naslov na berkeley.edu
Stuart RussellProfessor of Computer Science, University of California, BerkeleyPreverjeni e-poštni naslov na cs.berkeley.edu
Tianhao WuUniversity of California, BerkeleyPreverjeni e-poštni naslov na berkeley.edu
Evan FrickUC BerkeleyPreverjeni e-poštni naslov na berkeley.edu
Yuandong TianResearch Scientist, Meta AI (FAIR)Preverjeni e-poštni naslov na fb.com
Baihe HuangUniversity of California, BerkeleyPreverjeni e-poštni naslov na berkeley.edu
Paria RashidinejadPostdoctoral Scholar, University of California, BerkeleyPreverjeni e-poštni naslov na berkeley.edu
Michael I. JordanProfessor of Electrical Engineering and Computer Sciences and Professor of Statistics, UC BerkeleyPreverjeni e-poštni naslov na cs.berkeley.edu
Danqing WangCarnegie Mellon UniversityPreverjeni e-poštni naslov na andrew.cmu.edu
Kevin YangUC BerkeleyPreverjeni e-poštni naslov na berkeley.edu
Xiaomeng YangGoogle DeepMindPreverjeni e-poštni naslov na google.com
Ryuichi TakanobumiHoYoPreverjeni e-poštni naslov na mihoyo.com
Minlie Huangcomputer science, Tsinghua UniversityPreverjeni e-poštni naslov na tsinghua.edu.cn
Cyrus RashtchianGoogle ResearchPreverjeni e-poštni naslov na eng.ucsd.edu
David WoodruffProfessor of Computer Science, Carnegie Mellon UniversityPreverjeni e-poštni naslov na cs.cmu.edu
Kunhe YangUniversity of California, BerkeleyPreverjeni e-poštni naslov na berkeley.edu
Kannan RamchandranProfessor of Electrical Engineering and Computer Science, UC BerkeleyPreverjeni e-poštni naslov na eecs.berkeley.edu
Wei-Lin ChiangUC BerkeleyPreverjeni e-poštni naslov na berkeley.edu
Karthik GanesanAnyreachPreverjeni e-poštni naslov na anyreach.ai

Spremljaj

Hanlin Zhu

Ph.D. student, University of California, Berkeley

Preverjeni e-poštni naslov na berkeley.edu - Domača stran

machine learning theoretical computer science game theory


Naslov Razvrsti po navedbah Razvrsti po letniku Razvrsti po naslovu	Navedeno Navedeno	Leto
Starling-7b: Improving helpfulness and harmlessness with rlaif B Zhu, E Frick, T Wu, H Zhu, K Ganesan, WL Chiang, J Zhang, J Jiao First Conference on Language Modeling, 2024	116*	2024
Guided dialog policy learning: Reward estimation for multi-domain task-oriented dialog R Takanobu, H Zhu, M Huang Conference on Empirical Methods in Natural Language Processing, 100-110, 2019	101	2019
Optimal conservative offline rl with general function approximation via augmented lagrangian P Rashidinejad, H Zhu, K Yang, S Russell, J Jiao arXiv preprint arXiv:2211.00716, 2022	45	2022
Vector-matrix-vector queries for solving linear algebra, statistics, and graph problems C Rashtchian, DP Woodruff, H Zhu Approximation, Randomization, and Combinatorial Optimization. Algorithms and …, 2020	38	2020
Learning Personalized Alignment for Evaluating Open-ended Text Generation D Wang, K Yang, H Zhu, X Yang, A Cohen, L Li, Y Tian arXiv preprint arXiv:2310.03304, 2023	21*	2023
Importance weighted actor-critic for optimal conservative offline reinforcement learning H Zhu, P Rashidinejad, J Jiao Advances in Neural Information Processing Systems 36, 49579-49602, 2023	18	2023
Towards optimal statistical watermarking B Huang, H Zhu, B Zhu, K Ramchandran, MI Jordan, JD Lee, J Jiao arXiv preprint arXiv:2312.07930, 2023	16	2023
End-to-end story plot generator H Zhu, A Cohen, D Wang, K Yang, X Yang, J Jiao, Y Tian arXiv preprint arXiv:2310.08796, 2023	10	2023
Efficient prompt caching via embedding similarity H Zhu, B Zhu, J Jiao arXiv preprint arXiv:2402.01173, 2024	7	2024
On representation complexity of model-based and model-free reinforcement learning H Zhu, B Huang, S Russell arXiv preprint arXiv:2310.01706, 2023	7	2023
Towards a Theoretical Understanding of the'Reversal Curse'via Training Dynamics H Zhu, B Huang, S Zhang, M Jordan, J Jiao, Y Tian, SJ Russell Advances in Neural Information Processing Systems 37, 90473-90513, 2024	6	2024
Average-case communication complexity of statistical problems C Rashtchian, D Woodruff, P Ye, H Zhu Conference on Learning Theory, 3859-3886, 2021	6	2021
Provably efficient offline goal-conditioned reinforcement learning with general function approximation and single-policy concentrability H Zhu, A Zhang Advances in Neural Information Processing Systems 36, 4177-4198, 2023	5	2023
Provably efficient reinforcement learning via surprise bound H Zhu, R Wang, J Lee International Conference on Artificial Intelligence and Statistics, 4006-4032, 2023	5	2023
How Do LLMs Perform Two-Hop Reasoning in Context? T Guo, H Zhu, R Zhang, J Jiao, S Mei, MI Jordan, S Russell arXiv preprint arXiv:2502.13913, 2025		2025
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning DJ Su, H Zhu, Y Xu, J Jiao, Y Tian, Q Zheng arXiv preprint arXiv:2502.03275, 2025		2025
Avoiding Catastrophe in Online Learning by Asking for Help B Plaut, H Zhu, S Russell arXiv preprint arXiv:2402.08062, 2024		2024

Sistem trenutno ne more izvesti postopka. Poskusite znova pozneje.

Članki 1–17

Št. navedb na leto

Podvojene navedbe

Združene navedbe

Dodajanje soavtorjevSoavtorji

Spremljaj

Navedeno

Soavtorji