Runzhe Wu

Cytowane przez

	Wszystkie	Od 2020
Cytowania	186	185
h-indeks	7	7
i10-indeks	6	6

120

202220232024202512 36 110 26

Dostęp publiczny

Wyświetl wszystko

6 artykułów

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Współautorzy

Wen SunAssistant Professor, Cornell UniversityZweryfikowany adres z cornell.edu
Ayush SekhariPostdoctoral Associate, MITZweryfikowany adres z mit.edu
Karthik SridharanCornell University, University of Pennsylvania, Toyota Technological InstituteZweryfikowany adres z ttic.edu
Weinan ZhangProfessor, Shanghai Jiao Tong UniversityZweryfikowany adres z sjtu.edu.cn
Yong Yu (俞勇)Professor, Shanghai Jiao Tong UniversityZweryfikowany adres z sjtu.edu.cn
Kaiwen WangComputer Science PhD at Cornell TechZweryfikowany adres z cornell.edu
Zhaoran WangAssociate Professor at Northwestern UniversityZweryfikowany adres z northwestern.edu
Masatoshi UeharaEvolutionaryScaleZweryfikowany adres z evolutionaryscale.ai
Akshay KrishnamurthyUniversity of Massachusetts AmherstZweryfikowany adres z cs.umass.edu
Kianté BrantleyAssistant Professor, Harvard UniversityZweryfikowany adres z g.harvard.edu
Gokul SwamyPhD Candidate, Carnegie Mellon UniversityZweryfikowany adres z andrew.cmu.edu
Yiding ChenCornell UniversityZweryfikowany adres z cornell.edu
Zhuoran YangYale UniversityZweryfikowany adres z yale.edu

Obserwuj

Runzhe Wu

Cornell University

Zweryfikowany adres z cornell.edu - Strona główna

reinforcement learning machine learning


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Malib: A parallel framework for population-based multi-agent reinforcement learning M Zhou, Z Wan, H Wang, M Wen, R Wu, Y Wen, Y Yang, Y Yu, J Wang, ... Journal of Machine Learning Research 24 (150), 1-12, 2023	62	2023
Making rl with preference-based feedback efficient via randomization R Wu, W Sun arXiv preprint arXiv:2310.14554, 2023	26	2023
The benefits of being distributional: Small-loss bounds for reinforcement learning K Wang, K Zhou, R Wu, N Kallus, W Sun Advances in neural information processing systems 36, 2275-2312, 2023	22	2023
Contextual bandits and imitation learning with preference-based active queries A Sekhari, K Sridharan, W Sun, R Wu Advances in Neural Information Processing Systems 36, 11261-11295, 2023	20	2023
Offline constrained multi-objective reinforcement learning via pessimistic dual value iteration R Wu, Y Zhang, Z Yang, Z Wang Advances in Neural Information Processing Systems 34, 25439-25451, 2021	20	2021
Distributional offline policy evaluation with predictive error guarantees R Wu, M Uehara, W Sun International Conference on Machine Learning, 37685-37712, 2023	19	2023
Selective sampling and imitation learning via online regression A Sekhari, K Sridharan, W Sun, R Wu Advances in Neural Information Processing Systems 36, 67213-67268, 2023	9	2023
Computationally efficient rl under linear bellman completeness for deterministic dynamics R Wu, A Sekhari, A Krishnamurthy, W Sun arXiv preprint arXiv:2406.11810, 2024	5	2024
Diffusing States and Matching Scores: A New Framework for Imitation Learning R Wu, Y Chen, G Swamy, K Brantley, W Sun arXiv preprint arXiv:2410.13855, 2024	3	2024

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–9

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy