Follow
David Dohan
David Dohan
Google Brain
Verified email at google.com
Title
Cited by
Cited by
Year
Gpt-4 technical report
J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ...
arXiv preprint arXiv:2303.08774, 2023
92352023
Palm: Scaling language modeling with pathways
A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ...
Journal of Machine Learning Research 24 (240), 1-113, 2023
58042023
GPT-4 technical report
S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ...
arXiv.[https://arxiv. org/abs/2303.08774](Zugriff: 17.06. 2024), 2024
21022024
Unsupervised pixel-level domain adaptation with generative adversarial networks
K Bousmalis, N Silberman, D Dohan, D Erhan, D Krishnan
Proceedings of the IEEE conference on computer vision and pattern …, 2017
19912017
Rethinking attention with performers
K Choromanski, V Likhosherstov, D Dohan, X Song, A Gane, T Sarlos, ...
International Conference on Learning Representations, 2021
18732021
Program synthesis with large language models
J Austin, A Odena, M Nye, M Bosma, H Michalewski, D Dohan, E Jiang, ...
arXiv preprint arXiv:2108.07732, 2021
15282021
Qanet: Combining local convolution with global self-attention for reading comprehension
AW Yu, D Dohan, MT Luong, R Zhao, K Chen, M Norouzi, QV Le
International Conference on Learning Representations, 2018
1401*2018
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
arXiv preprint arXiv:2206.04615, 2022
13922022
Solving quantitative reasoning problems with language models
A Lewkowycz, A Andreassen, D Dohan, E Dyer, H Michalewski, ...
Advances in Neural Information Processing Systems 35, 3843-3857, 2022
7482022
Show your work: Scratchpads for intermediate computation with language models
M Nye, AJ Andreassen, G Gur-Ari, H Michalewski, J Austin, D Bieber, ...
6672021
Large language models can be easily distracted by irrelevant context
F Shi, X Chen, K Misra, N Scales, D Dohan, EH Chi, N Schärli, D Zhou
International Conference on Machine Learning, 31210-31227, 2023
4152023
Model-based reinforcement learning for biological sequence design
C Angermueller, D Dohan, D Belanger, R Deshpande, K Murphy, ...
International conference on learning representations, 2019
1582019
Palm: Scaling language modeling with pathways, 2022
A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ...
arXiv preprint arXiv:2204.02311, 2022
1372022
Palm: Scaling language modeling with pathways. arXiv 2022
A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ...
arXiv preprint arXiv:2204.02311 10, 1, 2022
1252022
Program synthesis with large language models. CoRR abs/2108.07732 (2021)
J Austin, A Odena, MI Nye, M Bosma, H Michalewski, D Dohan, E Jiang, ...
arXiv preprint arXiv:2108.07732, 2021
1182021
Masked language modeling for proteins via linearly scalable long-context transformers
K Choromanski, V Likhosherstov, D Dohan, X Song, A Gane, T Sarlos, ...
arXiv preprint arXiv:2006.03555, 2020
1122020
Gpt-4 technical report, 2024
S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ...
URL https://arxiv. org/abs/2303.08774 2, 6, 2024
1112024
Openai o1 system card
A Jaech, A Kalai, A Lerer, A Richardson, A El-Kishky, A Low, A Helyar, ...
arXiv preprint arXiv:2412.16720, 2024
1042024
Chi, Nathanael Schärli, and Denny Zhou. 2023. Large language models can be easily distracted by irrelevant context
F Shi, X Chen, K Misra, N Scales, D Dohan
arXiv preprint arXiv:2302.00093 12, 28, 2023
952023
Evoprompting: Language models for code-level neural architecture search
A Chen, D Dohan, D So
Advances in neural information processing systems 36, 7787-7817, 2023
912023
The system can't perform the operation now. Try again later.
Articles 1–20