Stanislav Fort

Citata da

	Tutte	Dal 2019
Citazioni	5947	5915
Indice H	22	22
i10-index	26	26

3100

1550

775

2325

20192020202120222023202436 155 337 615 1690 3052

Accesso pubblico

Visualizza tutto

5 articoli

0 articoli

Disponibili

Non disponibili

In base ai mandati di finanziamento

Coautori

Balaji LakshminarayananSenior Staff Research Scientist at Google DeepMindEmail verificata su google.com
Surya GanguliAssociate Professor, Stanford UniversityEmail verificata su stanford.edu
Clara Huiyi HuGoogle DeepMindEmail verificata su google.com
Stanisław JastrzębskiChief Technology Officer & Chief Scientist @ Molecule.OneEmail verificata su molecule.one
Jie RenResearch Scientist at Google BrainEmail verificata su google.com
Jeremiah Zhe LiuGoogle Research and Harvard UniversityEmail verificata su mail.harvard.edu
Dustin TranResearch Scientist, GoogleEmail verificata su google.com
Daniel M. RoyResearch Director, Vector Institute; Prof., U. Toronto (Statistics, CS)Email verificata su utoronto.ca
Gintare Karolina DziugaiteGoogle DeepMindEmail verificata su google.com
Srini NarayananUC Berkeley and Google DeepMindEmail verificata su icsi.berkeley.edu
Hui Khoon NgAssoc Prof, Yale-NUS College, and Centre for Quantum Technologies, National University of SingaporeEmail verificata su nus.edu.sg
Yihui QuekMassachusetts Institute of TechnologyEmail verificata su mit.edu
Dan WilkinsResearch Scientist, Stanford UniversityEmail verificata su stanford.edu
Jared KaplanJohns Hopkins University & AnthropicEmail verificata su pha.jhu.edu
Christopher OlahAnthropicEmail verificata su google.com

Segui

Stanislav Fort

Google DeepMind

Email verificata su stanford.edu - Home page

machine learning artificial intelligence AI safety


Titolo Ordina per citazioni Ordina per anno Ordina per titolo	Citata da Citata da	Anno
Dawn Drain, Stanislav Fort, Deep Ganguli, Tom Henighan, et al. Training a helpful and harmless assistant with reinforcement learning from human feedback Y Bai, A Jones, K Ndousse, A Askell, A Chen, N DasSarma arXiv preprint arXiv:2204.05862 1, 2022	1433*	2022
Constitutional AI: Harmlessness from AI Feedback Y Bai, S Kadavath, S Kundu, A Askell, J Kernion, A Jones, A Chen, ... arXiv preprint arXiv:2212.08073, 2022	1110	2022
Deep Ensembles: A Loss Landscape Perspective S Fort, H Hu, B Lakshminarayanan arXiv preprint arXiv:1912.02757, 2019	682	2019
Red teaming language models to reduce harms: Methods, scaling behaviors, and lessons learned D Ganguli, L Lovitt, J Kernion, A Askell, Y Bai, S Kadavath, B Mann, ... arXiv preprint arXiv:2209.07858, 2022	400	2022
Exploring the limits of out-of-distribution detection S Fort, J Ren, B Lakshminarayanan Advances in Neural Information Processing Systems 34, 7068-7081, 2021	351	2021
Predictability and surprise in large generative models D Ganguli, D Hernandez, L Lovitt, A Askell, Y Bai, A Chen, T Conerly, ... Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022	275	2022
Training independent subnetworks for robust prediction M Havasi, R Jenatton, S Fort, JZ Liu, J Snoek, B Lakshminarayanan, ... arXiv preprint arXiv:2010.06610, 2020	227	2020
A Simple Fix to Mahalanobis Distance for Improving Near-OOD Detection J Ren, S Fort, J Liu, AG Roy, S Padhy, B Lakshminarayanan arXiv preprint arXiv:2106.09022, 2021	201	2021
Deep learning versus kernel learning: an empirical study of loss landscape geometry and the time evolution of the neural tangent kernel S Fort, GK Dziugaite, M Paul, S Kharaghani, DM Roy, S Ganguli Advances in Neural Information Processing Systems 33, 5850-5861, 2020	186	2020
The Break-Even Point on Optimization Trajectories of Deep Neural Networks S Jastrzebski, M Szymczak, S Fort, D Arpit, J Tabor, K Cho, K Geras arXiv preprint arXiv:2002.09572, 2020	177	2020
Language models (mostly) know what they know S Kadavath, T Conerly, A Askell, T Henighan, D Drain, E Perez, ... arXiv preprint arXiv:2207.05221, 2022	142	2022
Gaussian Prototypical Networks for Few-Shot Learning on Omniglot S Fort arXiv preprint arXiv:1708.02735, 2017	102	2017
Large Scale Structure of Neural Network Loss Landscapes S Fort, S Jastrzebski arXiv preprint arXiv:1906.04724, 2019	91	2019
Stiffness: A new perspective on generalization in neural networks S Fort, PK Nowak, S Jastrzebski, S Narayanan arXiv preprint arXiv:1901.09491, 2019	88	2019
Measuring progress on scalable oversight for large language models SR Bowman, J Hyun, E Perez, E Chen, C Pettit, S Heiner, K Lukošiūtė, ... arXiv preprint arXiv:2211.03540, 2022	86	2022
Adaptive quantum state tomography with neural networks Y Quek, S Fort, HK Ng arXiv preprint arXiv:1812.06693, 2018	67	2018
Discovery of gamma-ray pulsations from the transitional redback PSR J1227-4853 TJ Johnson, PS Ray, J Roy, CC Cheung, AK Harding, HJ Pletsch, S Fort, ... The Astrophysical Journal 806 (1), 91, 2015	58	2015
The goldilocks zone: Towards better understanding of neural network loss landscapes S Fort, A Scherlis Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 3574-3581, 2019	48	2019
Emergent properties of the local geometry of neural loss landscapes S Fort, S Ganguli arXiv preprint arXiv:1910.05929, 2019	45	2019
Analyzing monotonic linear interpolation in neural network loss landscapes J Lucas, J Bae, MR Zhang, S Fort, R Zemel, R Grosse arXiv preprint arXiv:2104.11044, 2021	38*	2021

Il sistema al momento non può eseguire l'operazione. Riprova più tardi.

Articoli 1–20

Citazioni per anno

Citazioni duplicate

Citazioni unite

Aggiungi coautoriCoautori

Segui

Citata da

Coautori