Følg
Yuhuai(Tony) Wu
Yuhuai(Tony) Wu
Co-Founder of xAI
Verifisert e-postadresse på x.ai - Startside
Tittel
Sitert av
Sitert av
År
Grandmaster level in StarCraft II using multi-agent reinforcement learning
O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ...
nature 575 (7782), 350-354, 2019
5377*2019
On the opportunities and risks of foundation models
R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ...
arXiv preprint arXiv:2108.07258, 2021
42252021
Openai baselines
P Dhariwal, C Hesse, O Klimov, A Nichol, M Plappert, A Radford, ...
1967*2017
Palm 2 technical report
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 2023
14032023
Holistic evaluation of language models
P Liang, R Bommasani, T Lee, D Tsipras, D Soylu, M Yasunaga, Y Zhang, ...
arXiv preprint arXiv:2211.09110, 2022
10702022
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Y Wu, E Mansimov, RB Grosse, S Liao, J Ba
Advances in Neural Information Processing Systems, 5283-5292, 2017
8452017
Solving quantitative reasoning problems with language models
A Lewkowycz, A Andreassen, D Dohan, E Dyer, H Michalewski, ...
Advances in Neural Information Processing Systems 35, 3843-3857, 2022
6332022
STaR: Bootstrapping reasoning with reasoning
E Zelikman, Y Wu, ND Goodman
arXiv preprint arXiv:2203.14465, 2022
4682022
Backpropagation through the void: Optimizing control variates for black-box gradient estimation
W Grathwohl, D Choi, Y Wu, G Roeder, D Duvenaud
ICLR2018, 2017
3302017
Sticking the landing: Simple, lower-variance gradient estimators for variational inference
G Roeder, Y Wu, DK Duvenaud
Advances in Neural Information Processing Systems 30, 2017
291*2017
On the quantitative analysis of decoder-based generative models
Y Wu, Y Burda, R Salakhutdinov, R Grosse
5th International Conference on Learning Representations (ICLR 2017), 2016
2762016
Solving olympiad geometry without human demonstrations
TH Trinh, Y Wu, QV Le, H He, T Luong
Nature 625 (7995), 476-482, 2024
2532024
Memorizing Transformers
Y Wu, MN Rabe, DL Hutchins, C Szegedy
International Conference on Learning Representations 2022, 2022
2452022
Architectural complexity measures of recurrent neural networks
S Zhang, Y Wu, T Che, Z Lin, R Memisevic, RR Salakhutdinov, Y Bengio
Advances in neural information processing systems 29, 2016
1952016
STDP-compatible approximation of backpropagation in an energy-based model
Y Bengio, T Mesnard, A Fischer, S Zhang, Y Wu
Neural computation 29 (3), 555-577, 2017
189*2017
On multiplicative integration with recurrent neural networks
Y Wu, S Zhang, Y Zhang, Y Bengio, RR Salakhutdinov
Advances in neural information processing systems 29, 2016
1852016
Exploring length generalization in large language models
C Anil, Y Wu, A Andreassen, A Lewkowycz, V Misra, V Ramasesh, ...
Advances in Neural Information Processing Systems 35, 38546-38556, 2022
1772022
The Importance of Sampling in Meta-Reinforcement Learning
B Stadie, G Yang, R Houthooft, P Chen, Y Duan, Y Wu, P Abbeel, ...
Advances in Neural Information Processing Systems, 9299-9309, 2018
177*2018
Invariant Causal Representation Learning for Out-of-Distribution Generalization
C Lu, Y Wu, JM Hernández-Lobato, B Schölkopf
International Conference on Learning Representations, 2022
151*2022
Autoformalization with large language models
Y Wu, AQ Jiang, W Li, M Rabe, C Staats, M Jamnik, C Szegedy
Advances in Neural Information Processing Systems 35, 32353-32368, 2022
1492022
Systemet kan ikke utføre handlingen. Prøv på nytt senere.
Artikler 1–20