Volgen
Ziyu Wang
Ziyu Wang
Deepmind
Geverifieerd e-mailadres voor google.com - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Taking the human out of the loop: A review of Bayesian optimization
B Shahriari, K Swersky, Z Wang, RP Adams, N De Freitas
Proceedings of the IEEE 104 (1), 148-175, 2015
57512015
Dueling network architectures for deep reinforcement learning
Z Wang, T Schaul, M Hessel, H Hasselt, M Lanctot, N Freitas
International conference on machine learning, 1995-2003, 2016
53182016
Grandmaster level in StarCraft II using multi-agent reinforcement learning
O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ...
nature 575 (7782), 350-354, 2019
48582019
Emergence of locomotion behaviours in rich environments
N Heess, D Tb, S Sriram, J Lemmon, J Merel, G Wayne, Y Tassa, T Erez, ...
arXiv preprint arXiv:1707.02286, 2017
11572017
Sample efficient actor-critic with experience replay
Z Wang, V Bapst, N Heess, V Mnih, R Munos, K Kavukcuoglu, ...
arXiv preprint arXiv:1611.01224, 2016
10332016
Bayesian optimization in a billion dimensions via random embeddings
Z Wang, F Hutter, M Zoghi, D Matheson, N De Feitas
Journal of Artificial Intelligence Research 55, 361-387, 2016
8682016
Alphastar: Mastering the real-time strategy game starcraft ii
O Vinyals, I Babuschkin, J Chung, M Mathieu, M Jaderberg, ...
DeepMind blog 2, 20, 2019
5702019
Autonomous navigation of stratospheric balloons using reinforcement learning
MG Bellemare, S Candido, PS Castro, J Gong, MC Machado, S Moitra, ...
Nature 588 (7836), 77-82, 2020
4182020
Reinforcement and imitation learning for diverse visuomotor skills
Y Zhu, Z Wang, J Merel, A Rusu, T Erez, S Cabi, S Tunyasuvunakool, ...
arXiv preprint arXiv:1802.09564, 2018
3792018
Deep fried convnets
Z Yang, M Moczulski, M Denil, N De Freitas, A Smola, L Song, Z Wang
Proceedings of the IEEE international conference on computer vision, 1476-1483, 2015
3562015
Learning an embedding space for transferable robot skills
K Hausman, JT Springenberg, Z Wang, N Heess, M Riedmiller
International Conference on Learning Representations, 2018
3512018
Critic regularized regression
Z Wang, A Novikov, K Zolna, JS Merel, JT Springenberg, SE Reed, ...
Advances in Neural Information Processing Systems 33, 7768-7778, 2020
3352020
Playing hard exploration games by watching youtube
Y Aytar, T Pfaff, D Budden, T Paine, Z Wang, N De Freitas
Advances in neural information processing systems 31, 2018
3192018
Acme: A research framework for distributed reinforcement learning
MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ...
arXiv preprint arXiv:2006.00979, 2020
2652020
Robust imitation of diverse behaviors
Z Wang, JS Merel, SE Reed, N de Freitas, G Wayne, N Heess
Advances in Neural Information Processing Systems 30, 2017
2502017
Learning human behaviors from motion capture by adversarial imitation
J Merel, Y Tassa, D TB, S Srinivasan, J Lemmon, Z Wang, G Wayne, ...
arXiv preprint arXiv:1707.02201, 2017
2452017
Parallel multiscale autoregressive density estimation
S Reed, A Oord, N Kalchbrenner, SG Colmenarejo, Z Wang, Y Chen, ...
International conference on machine learning, 2912-2921, 2017
2422017
Rl unplugged: A suite of benchmarks for offline reinforcement learning
C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ...
Advances in Neural Information Processing Systems 33, 7248-7259, 2020
1922020
Hyperparameter selection for offline reinforcement learning
TL Paine, C Paduraru, A Michi, C Gulcehre, K Zolna, A Novikov, Z Wang, ...
arXiv preprint arXiv:2007.09055, 2020
1672020
Bayesian optimization in alphago
Y Chen, A Huang, Z Wang, I Antonoglou, J Schrittwieser, D Silver, ...
arXiv preprint arXiv:1812.06855, 2018
1602018
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20