Iason Gabriel

Citado por

	Todos	Desde 2019
Citações	5040	5007
Índice h	19	19
Índice i10	26	26

2700

1350

675

2025

20192020202120222023202427 36 119 518 1638 2617

Acesso público

Ver todos

1 artigo

0 artigo

disponível

não disponível

Com base nas autorizações de financiamento

Coautores

William S. IsaacPrincipal Scientist (Director), Google DeepMindE-mail confirmado em google.com
Laura WeidingerStaff Research Scientist at DeepMindE-mail confirmado em google.com
Lisa Anne M HendricksDeepMindE-mail confirmado em google.com
Maribeth RauhResearch Engineer, DeepMindE-mail confirmado em deepmind.com
abeba birhaneAdjunct assistant professor at the school of computer science and statistics, Trinity College DublinE-mail confirmado em tcd.ie
Atoosa KasirzadehGoogle & Carnegie Mellon UniversityE-mail confirmado em google.com
Will HawkinsDeepMindE-mail confirmado em deepmind.com
John F J MellorDeepMindE-mail confirmado em deepmind.com
Zachary KentonGoogle DeepMindE-mail confirmado em google.com
Geoffrey IrvingUK AI Safety Institute (AISI)E-mail confirmado em naml.us
Po-Sen HuangResearch Scientist, DeepMindE-mail confirmado em google.com
Amelia GlaeseGoogle DeepMindE-mail confirmado em deepmind.com
Borja BalleDeepMindE-mail confirmado em google.com
Myra ChengStanfordE-mail confirmado em stanford.edu
Laura RimellDeepMindE-mail confirmado em google.com
Julia HaasSenior Research Scientist, DeepMindE-mail confirmado em google.com
Sasha Blake BrownGoogle DeepMindE-mail confirmado em deepmind.com
Jonathan UesatoE-mail confirmado em mit.edu
Vinodkumar PrabhakaranStaff Research Scientist, Google LLCE-mail confirmado em google.com
Shakir MohamedResearch Director, Google DeepMindE-mail confirmado em deepmind.com

Seguir

Iason Gabriel

Senior Staff Research Scientist, Google DeepMind

E-mail confirmado em google.com

Political Theory Moral Philosophy Philosophy of AI Global Justice Human Rights


Título Ordenar por citações Ordenar por ano Ordenar por título	Citado por Citado por	Ano
Scaling language models: Methods, analysis & insights from training gopher JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... arXiv preprint arXiv:2112.11446, 2021	1050	2021
Ethical and social risks of harm from language models L Weidinger, J Mellor, M Rauh, C Griffin, J Uesato, PS Huang, M Cheng, ... arXiv preprint arXiv:2112.04359, 2021	943	2021
Artificial intelligence, values, and alignment I Gabriel Minds and machines 30 (3), 411-437, 2020	701	2020
Taxonomy of risks posed by language models L Weidinger, J Uesato, M Rauh, C Griffin, PS Huang, J Mellor, A Glaese, ... Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022	547	2022
Improving alignment of dialogue agents via targeted human judgements A Glaese, N McAleese, M Trębacz, J Aslanides, V Firoiu, T Ewalds, ... arXiv preprint arXiv:2209.14375, 2022	452	2022
Power to the people? Opportunities and challenges for participatory AI A Birhane, W Isaac, V Prabhakaran, M Diaz, MC Elish, I Gabriel, ... Proceedings of the 2nd ACM Conference on Equity and Access in Algorithms …, 2022	213	2022
Effective altruism and its critics I Gabriel Journal of Applied Philosophy 34 (4), 457-473, 2017	158	2017
Alignment of language agents Z Kenton, T Everitt, L Weidinger, I Gabriel, V Mikulik, G Irving arXiv preprint arXiv:2103.14659, 2021	155	2021
Model evaluation for extreme risks T Shevlane, S Farquhar, B Garfinkel, M Phuong, J Whittlestone, J Leung, ... arXiv preprint arXiv:2305.15324, 2023	132	2023
In conversation with artificial intelligence: aligning language models with human values A Kasirzadeh, I Gabriel Philosophy & Technology 36 (2), 27, 2023	116	2023
Sociotechnical safety evaluation of generative ai systems L Weidinger, M Rauh, N Marchal, A Manzini, LA Hendricks, ... arXiv preprint arXiv:2310.11986, 2023	103	2023
Toward a theory of justice for artificial intelligence I Gabriel Daedalus 151 (2), 218-231, 2022	76	2022
The Challenge of Value Alignment I Gabriel, V Ghazavi The Oxford Handbook of Digital Ethics, 2022	56*	2022
A human rights-based approach to responsible AI V Prabhakaran, M Mitchell, T Gebru, I Gabriel arXiv preprint arXiv:2210.02667, 2022	50	2022
Characteristics of harmful text: Towards rigorous benchmarking of language models M Rauh, J Mellor, J Uesato, PS Huang, J Welbl, L Weidinger, S Dathathri, ... Advances in Neural Information Processing Systems 35, 24720-24739, 2022	46	2022
Using the Veil of Ignorance to align AI systems with principles of justice L Weidinger, KR McKee, R Everett, S Huang, TO Zhu, MJ Chadwick, ... Proceedings of the National Academy of Sciences 120 (18), e2213709120, 2023	31	2023
Beyond privacy trade-offs with structured transparency A Trask, E Bluemke, T Collins, BGE Drexler, CG Cuervas-Mons, I Gabriel, ... arXiv preprint arXiv:2012.08347, 2020	30	2020
The ethics of advanced ai assistants I Gabriel, A Manzini, G Keeling, LA Hendricks, V Rieser, H Iqbal, ... arXiv preprint arXiv:2404.16244, 2024	28	2024
Permissible secrets H Lazenby, I Gabriel The Philosophical Quarterly 68 (271), 265-285, 2018	22	2018
Representation in AI evaluations AS Bergman, LA Hendricks, M Rauh, B Wu, W Agnew, M Kunesch, I Duan, ... Proceedings of the 2023 ACM Conference on Fairness, Accountability, and …, 2023	18	2023

O sistema não pode executar a operação agora. Tente novamente mais tarde.

Artigos 1–20

Citações por ano

Citações duplicadas

Citações mescladas

Adicionar coautoresCoautores

Seguir

Citado por

Coautores