Exploring the limits of transfer learning with a unified text-to-text transformer C Raffel, N Shazeer, A Roberts, K Lee, S Narang, M Matena, Y Zhou, W Li, ... Journal of Machine Learning Research, 2020 | 19775 | 2020 |
Fixmatch: Simplifying semi-supervised learning with consistency and confidence K Sohn, D Berthelot, CL Li, Z Zhang, N Carlini, ED Cubuk, A Kurakin, ... Neural Information Processing Systems, 2020 | 3877 | 2020 |
Mixmatch: A holistic approach to semi-supervised learning D Berthelot, N Carlini, I Goodfellow, N Papernot, A Oliver, C Raffel Neural Information Processing Systems, 2019 | 3719 | 2019 |
librosa: Audio and music signal analysis in python B McFee, C Raffel, D Liang, DPW Ellis, M McVicar, E Battenberg, O Nieto Python in Science Conference, 2015 | 3320 | 2015 |
Emergent abilities of large language models J Wei, Y Tay, R Bommasani, C Raffel, B Zoph, S Borgeaud, D Yogatama, ... Transactions on Machine Learning Research, 2022 | 3014 | 2022 |
mT5: A massively multilingual pre-trained text-to-text transformer L Xue, N Constant, A Roberts, M Kale, R Al-Rfou, A Siddhant, A Barua, ... Annual Conference of the North American Chapter of the Association for …, 2020 | 2349 | 2020 |
Extracting training data from large language models N Carlini, F Tramer, E Wallace, M Jagielski, A Herbert-Voss, K Lee, ... USENIX Security Symposium, 2021 | 1791 | 2021 |
Multitask Prompted Training Enables Zero-Shot Task Generalization V Sanh, A Webson, C Raffel, SH Bach, L Sutawika, Z Alyafeai, A Chaffin, ... International Conference on Learning Representations, 2021 | 1684 | 2021 |
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... | 1638 | 2023 |
Remixmatch: Semi-supervised learning with distribution alignment and augmentation anchoring D Berthelot, N Carlini, ED Cubuk, A Kurakin, K Sohn, H Zhang, C Raffel International Conference on Learning Representations, 2019 | 1289 | 2019 |
Realistic evaluation of deep semi-supervised learning algorithms A Oliver, A Odena, C Raffel, ED Cubuk, I Goodfellow Neural Information Processing Systems, 2018 | 1261 | 2018 |
Theano: A Python framework for fast computation of mathematical expressions R Al-Rfou, G Alain, A Almahairi, C Angermueller, D Bahdanau, N Ballas, ... arXiv preprint arXiv:1605.02688, 2016 | 1237 | 2016 |
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... Transactions on Machine Learning Research, 2023 | 1140 | 2023 |
Probabilistic machine learning: an introduction KP Murphy MIT press, 2022 | 1110 | 2022 |
How Much Knowledge Can You Pack Into the Parameters of a Language Model? A Roberts, C Raffel, N Shazeer Conference on Empirical Methods in Natural Language Processing, 2020 | 863 | 2020 |
Thermometer Encoding: One Hot Way To Resist Adversarial Examples J Buckman, A Roy, C Raffel, I Goodfellow International Conference on Learning Representations, 2018 | 777 | 2018 |
Few-shot parameter-efficient fine-tuning is better and cheaper than in-context learning H Liu, D Tam, M Muqeeth, J Mohta, T Huang, M Bansal, C Raffel Neural Information Processing Systems, 2022 | 735 | 2022 |
A hierarchical latent vector model for learning long-term structure in music A Roberts, J Engel, C Raffel, C Hawthorne, D Eck International Conference on Machine Learning, 2018 | 634 | 2018 |
mir_eval: A Transparent Implementation of Common MIR Metrics C Raffel, B McFee, EJ Humphrey, J Salamon, O Nieto, D Liang, DPW Ellis International Society for Music Information Retrieval Conference, 2014 | 627 | 2014 |
Crosslingual generalization through multitask finetuning N Muennighoff, T Wang, L Sutawika, A Roberts, S Biderman, TL Scao, ... Annual Meeting of the Association for Computational Linguistics, 2023 | 624 | 2023 |