Attention is all you need A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... Advances in neural information processing systems 30, 2017 | 170872 | 2017 |
Natural questions: a benchmark for question answering research T Kwiatkowski, J Palomaki, O Redfield, M Collins, A Parikh, C Alberti, ... Transactions of the Association for Computational Linguistics 7, 453-466, 2019 | 3163 | 2019 |
Prottrans: Toward understanding the language of life through self-supervised learning A Elnaggar, M Heinzinger, C Dallago, G Rehawi, Y Wang, L Jones, ... IEEE transactions on pattern analysis and machine intelligence 44 (10), 7112 …, 2021 | 1876 | 2021 |
Advances in neural information processing systems 30 A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... Curran Associates Inc, 2017 | 1054 | 2017 |
L. u. Kaiser, and I. Polosukhin,“Attention is all you need,” A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez Advances in neural information processing systems 30, 5998-6008, 2017 | 919* | 2017 |
Gomez Aidan N., Kaiser Łukasz, Polosukhin Illia, Attention is all you need V Ashish, S Noam, P Niki, U Jakob, J Llion Adv. Neural Inf. Process. Syst 30, 1-11, 2017 | 792 | 2017 |
Tensor2tensor for neural machine translation A Vaswani, S Bengio, E Brevdo, F Chollet, AN Gomez, S Gouws, L Jones, ... arXiv preprint arXiv:1803.07416, 2018 | 674 | 2018 |
The best of both worlds: Combining recent advances in neural machine translation MX Chen, O Firat, A Bapna, M Johnson, W Macherey, G Foster, L Jones, ... arXiv preprint arXiv:1804.09849, 2018 | 546 | 2018 |
Character-level language modeling with deeper self-attention R Al-Rfou, D Choe, N Constant, M Guo, L Jones Proceedings of the AAAI conference on artificial intelligence 33 (01), 3159-3166, 2019 | 494 | 2019 |
One model to learn them all L Kaiser, AN Gomez, N Shazeer, A Vaswani, N Parmar, L Jones, ... arXiv preprint arXiv:1706.05137, 2017 | 407 | 2017 |
Attention is all you need. CoRR abs/1706.03762 (2017) A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... | 347 | 2017 |
Attention is all you need, 2023 A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... arXiv preprint arXiv:1706.03762, 2023 | 321* | 2023 |
A race-track trapped-ion quantum processor SA Moses, CH Baldwin, MS Allman, R Ancona, L Ascarrunz, C Barnes, ... Physical Review X 13 (4), 041052, 2023 | 280 | 2023 |
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019 | 212 | 2019 |
& Polosukhin, I.(2017) A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... Attention is all you need. In: Advances in neural information processing …, 2017 | 190 | 2017 |
Attention is all you need. arXiv 2023 A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... arXiv preprint arXiv:1706.03762, 2023 | 177 | 2023 |
Wikireading: A novel large-scale language understanding task over wikipedia D Hewlett, A Lacoste, L Jones, I Polosukhin, A Fandrianto, J Han, ... arXiv preprint arXiv:1608.03542, 2016 | 172 | 2016 |
& Polosukhin A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... I.,“Attention is all you need,” In Advances in neural information processing …, 2017 | 151 | 2017 |
Attention is all you need. 2017. doi: 10.48550 A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... arXiv preprint ARXIV.1706.03762 2 (2), 2017 | 136 | 2017 |
Attention is all you need. arXiv, 2017. doi: 10.48550 A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... arXiv preprint arXiv.1706.03762 1706, 2017 | 124 | 2017 |