Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 2084 | 2023 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 617 | 2024 |
Towards understanding knowledge distillation M Phuong, C Lampert International conference on machine learning, 5142-5151, 2019 | 331 | 2019 |
Distillation-based training for multi-exit architectures M Phuong, CH Lampert Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 209 | 2019 |
Model evaluation for extreme risks T Shevlane, S Farquhar, B Garfinkel, M Phuong, J Whittlestone, J Leung, ... arXiv preprint arXiv:2305.15324, 2023 | 132 | 2023 |
Formal algorithms for transformers M Phuong, M Hutter arXiv preprint arXiv:2207.09238, 2022 | 97 | 2022 |
Goal misgeneralization: Why correct specifications aren't enough for correct goals R Shah, V Varma, R Kumar, M Phuong, V Krakovna, J Uesato, Z Kenton arXiv preprint arXiv:2210.01790, 2022 | 57 | 2022 |
The inductive bias of ReLU networks on orthogonally separable data P Bui Thi Mai, C Lampert 9th International Conference on Learning Representations, 2021 | 44 | 2021 |
Functional vs. parametric equivalence of ReLU networks P Bui Thi Mai, C Lampert 8th International Conference on Learning Representations, 2020 | 39 | 2020 |
Evaluating frontier models for dangerous capabilities M Phuong, M Aitchison, E Catt, S Cogan, A Kaskasoli, V Krakovna, ... arXiv preprint arXiv:2403.13793, 2024 | 35 | 2024 |
The mutual autoencoder: Controlling information in latent code representations M Phuong, M Welling, N Kushman, R Tomioka, S Nowozin | 24 | 2018 |
Model evaluation for extreme risks (arXiv: 2305.15324). arXiv T Shevlane, S Farquhar, B Garfinkel, M Phuong, J Whittlestone, J Leung, ... | 10 | 2023 |
The mutual autoencoder: Controlling information in latent code representations, 2018 M Phuong, M Welling, N Kushman, R Tomioka, S Nowozin URL https://openreview. net/forum, 0 | 8 | |
Against the flow of time with multi-output models J Jakubík, P Bui Thi Mai, M Chvosteková, A Krakovská Measurement Science Review 23 (4), 2023 | | 2023 |
Underspecification in deep learning P Bui Thi Mai | | 2021 |