Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 1400 | 2023 |
Towards a human-like open-domain chatbot D Adiwardana, MT Luong, DR So, J Hall, N Fiedel, R Thoppilan, Z Yang, ... arXiv preprint arXiv:2001.09977, 2020 | 1126 | 2020 |
Carbon emissions and large neural network training D Patterson, J Gonzalez, Q Le, C Liang, LM Munguia, D Rothchild, D So, ... arXiv preprint arXiv:2104.10350, 2021 | 802 | 2021 |
Pay attention to mlps H Liu, Z Dai, D So, QV Le Advances in neural information processing systems 34, 9204-9215, 2021 | 630 | 2021 |
The evolved transformer D So, Q Le, C Liang International conference on machine learning, 5877-5886, 2019 | 550 | 2019 |
Automl-zero: Evolving machine learning algorithms from scratch E Real, C Liang, D So, Q Le International conference on machine learning, 8007-8019, 2020 | 336 | 2020 |
The carbon footprint of machine learning training will plateau, then shrink D Patterson, J Gonzalez, U Hölzle, Q Le, C Liang, LM Munguia, ... Computer 55 (7), 18-28, 2022 | 318 | 2022 |
Searching for efficient transformers for language modeling D So, W Mańke, H Liu, Z Dai, N Shazeer, QV Le Advances in neural information processing systems 34, 6010-6022, 2021 | 161 | 2021 |
Classification of crystallization outcomes using deep convolutional neural networks AE Bruno, P Charbonneau, J Newman, EH Snell, DR So, V Vanhoucke, ... PLOS one 13 (6), e0198883, 2018 | 86 | 2018 |
Transcending scaling laws with 0.1% extra compute Y Tay, J Wei, HW Chung, VQ Tran, DR So, S Shakeri, X Garcia, HS Zheng, ... arXiv preprint arXiv:2210.11399, 2022 | 67 | 2022 |
EvoPrompting: language models for code-level neural architecture search A Chen, D Dohan, D So Advances in Neural Information Processing Systems 36, 2024 | 61 | 2024 |
Mufasa: Multimodal fusion architecture search for electronic health records Z Xu, DR So, AM Dai Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10532 …, 2021 | 59 | 2021 |
Towards a human-like open-domain chatbot. arXiv 2020 D Adiwardana, MT Luong, DR So, J Hall, N Fiedel, R Thoppilan, Z Yang, ... arXiv preprint arXiv:2001.09977, 2001 | 42 | 2001 |
Brainformers: Trading simplicity for efficiency Y Zhou, N Du, Y Huang, D Peng, C Lan, D Huang, S Shakeri, D So, ... International Conference on Machine Learning, 42531-42542, 2023 | 21 | 2023 |
Computationally efficient neural network architecture search DM Dohan, DR So, C Liang, QV Le US Patent 10,997,503, 2021 | 17 | 2021 |
Towards a human-like open-domain chatbot A Kulshreshtha, DDF Adiwardana, DR So, G Nemade, J Hall, N Fiedel, ... arXiv preprint arXiv:2001.09977, 2020 | 9 | 2020 |
Improving image generative models with human interactions AK Lampinen, D So, D Eck, F Bertsch arXiv preprint arXiv:1709.10459, 2017 | 5 | 2017 |
Unified functional hashing in automatic machine learning R Gillard, S Jonany, Y Miao, M Munn, C de Souza, J Dungay, C Liang, ... arXiv preprint arXiv:2302.05433, 2023 | 2 | 2023 |
Multi-modal neural network architecture search Z Xu, DR So, AM Dai US Patent App. 17/915,796, 2023 | 1 | 2023 |
Granular neural network architecture search over low-level primitives DR So, QV Le Jr, H Liu, WA Manke, Z Dai, NM Shazeer US Patent App. 17/827,362, 2022 | 1 | 2022 |