Prioritized training on points that are learnable, worth learning, and not yet learnt S Mindermann, JM Brauner, MT Razzak, M Sharma, A Kirsch, W Xu, ... International Conference on Machine Learning, 15630-15649, 2022 | 160 | 2022 |
Mitigating harm in language models with conditional-likelihood filtration H Ngo, C Raterink, JGM Araújo, I Zhang, C Chen, A Morisot, N Frosst arXiv preprint arXiv:2108.07790, 2021 | 34 | 2021 |
Aya expanse: Combining research breakthroughs for a new multilingual frontier J Dang, S Singh, D D'souza, A Ahmadian, A Salamanca, M Smith, ... arXiv preprint arXiv:2412.04261, 2024 | 4 | 2024 |
To code, or not to code? exploring impact of code in pre-training V Aryabumi, Y Su, R Ma, A Morisot, I Zhang, A Locatelli, M Fadaee, ... arXiv preprint arXiv:2408.10914, 2024 | 3 | 2024 |
Add a SideNet to your MainNet A Morisot arXiv preprint arXiv:2007.13512, 2020 | | 2020 |