Seguir
Aidan Gomez
Aidan Gomez
Cohere
Email confirmado em cohere.ai - Página inicial
Título
Citado por
Citado por
Ano
Attention is all you need (arXiv: 1706.03762). arXiv
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
173001*2017
Tensor2tensor for neural machine translation
A Vaswani, S Bengio, E Brevdo, F Chollet, AN Gomez, S Gouws, L Jones, ...
arXiv preprint arXiv:1803.07416, 2018
6742018
The reversible residual network: Backpropagation without storing activations
AN Gomez, M Ren, R Urtasun, RB Grosse
Advances in neural information processing systems 30, 2017
6382017
Disease variant prediction with deep generative models of evolutionary data
J Frazer, P Notin, M Dias, A Gomez, JK Min, K Brock, Y Gal, DS Marks
Nature 599 (7883), 91-95, 2021
6092021
Depthwise Separable Convolutions for Neural Machine Translation
L Kaiser, AN Gomez, F Chollet
International Conference on Learning Representations, 2018
4072018
One model to learn them all
L Kaiser, AN Gomez, N Shazeer, A Vaswani, N Parmar, L Jones, ...
arXiv preprint arXiv:1706.05137, 2017
4072017
Attention is all you need, 2023
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
arXiv preprint arXiv:1706.03762, 2023
319*2023
Attention is all you need. arXiv. org
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
arXiv preprint arXiv:1706.03762, 2017
247*2017
Tranception: protein fitness prediction with autoregressive transformers and inference-time retrieval
P Notin, M Dias, J Frazer, J Marchena-Hurtado, AN Gomez, D Marks, ...
International Conference on Machine Learning, 16990-17017, 2022
2172022
Attention is all you need
A Waswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, A Gomez, ...
NIPS, 2017
2092017
& Polosukhin, I.(2017)
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
Attention is all you need. In: Advances in neural information processing …, 2017
1892017
Attention is all you need. arXiv 2023
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
arXiv preprint arXiv:1706.03762, 2023
1742023
Prioritized training on points that are learnable, worth learning, and not yet learnt
S Mindermann, JM Brauner, MT Razzak, M Sharma, A Kirsch, W Xu, ...
International Conference on Machine Learning, 15630-15649, 2022
1602022
A systematic comparison of bayesian deep learning robustness in diabetic retinopathy tasks
A Filos, S Farquhar, AN Gomez, TGJ Rudner, Z Kenton, L Smith, ...
arXiv preprint arXiv:1912.10481, 2019
155*2019
Self-attention between datapoints: Going beyond individual input-output pairs in deep learning
J Kossen, N Band, C Lyle, AN Gomez, T Rainforth, Y Gal
Advances in Neural Information Processing Systems 34, 28742-28756, 2021
1482021
Learning Sparse Networks Using Targeted Dropout
AN Gomez, I Zhang, S Rao Kamalakara, D Madaan, K Swersky, Y Gal, ...
arXiv preprint arXiv:1905.13678, 2019
1292019
The difficulty of training sparse neural networks
U Evci, F Pedregosa, A Gomez, E Elsen
arXiv preprint arXiv:1906.10732, 2019
1122019
Unsupervised cipher cracking using discrete GANs
AN Gomez, S Huang, I Zhang, BM Li, M Osama, L Kaiser
arXiv preprint arXiv:1801.04883, 2018
842018
Attention is all you need (Version 7). arXiv
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
arXiv preprint arXiv:1706.03762, 2017
812017
Attention is all You need, August 2023
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
arXiv preprint arXiv:1706.03762, 0
77
O sistema não pode efectuar a operação agora. Tente mais tarde.
Artigos 1–20