The power of scale for parameter-efficient prompt tuning B Lester, R Al-Rfou, N Constant arXiv preprint arXiv:2104.08691, 2021 | 3529 | 2021 |
Finetuned language models are zero-shot learners J Wei, M Bosma, VY Zhao, K Guu, AW Yu, B Lester, N Du, AM Dai, QV Le arXiv preprint arXiv:2109.01652, 2021 | 3107 | 2021 |
Spot: Better frozen model adaptation through soft prompt transfer T Vu, B Lester, N Constant, R Al-Rfou, D Cer arXiv preprint arXiv:2110.07904, 2021 | 264 | 2021 |
Scaling up models and data with t5x and seqio A Roberts, HW Chung, G Mishra, A Levskaya, J Bradbury, D Andor, ... Journal of Machine Learning Research 24 (377), 1-8, 2023 | 151 | 2023 |
Overcoming catastrophic forgetting in zero-shot cross-lingual generation T Vu, A Barua, B Lester, D Cer, M Iyyer, N Constant arXiv preprint arXiv:2205.12647, 2022 | 57 | 2022 |
An Effective Label Noise Model for DNN Text Classification I Jindal, D Pressel, B Lester, M Nokleby Proceedings of the 2019 Conference of the North American Chapter of the …, 2019 | 52 | 2019 |
Baseline: A library for rapid modeling, experimentation and development of deep learning algorithms targeting nlp D Pressel, SR Choudhury, B Lester, Y Zhao, M Barta Proceedings of Workshop for NLP Open Source Software (NLP-OSS), 34-40, 2018 | 15* | 2018 |
Parameter Efficient Prompt Tuning for Efficient Models at Scale BD Lester, R Al-Rfou, N Constant US Patent App. 17/718,738, 2023 | 10 | 2023 |
Reducing retraining by recycling parameter-efficient prompts B Lester, J Yurtsever, S Shakeri, N Constant arXiv preprint arXiv:2208.05577, 2022 | 10 | 2022 |
Git-theta: A git extension for collaborative development of machine learning models N Kandpal, B Lester, M Muqeeth, A Mascarenhas, M Evans, V Baskaran, ... International Conference on Machine Learning, 15708-15719, 2023 | 9 | 2023 |
Multiple Word Embeddings for Increased Diversity of Representation B Lester, D Pressel, A Hemmeter, SR Choudhury, S Bangalore arXiv preprint arXiv:2009.14394, 2020 | 9* | 2020 |
iobes: A Library for Span-Level Processing B Lester Proceedings of Second Workshop for NLP Open Source Software (NLP-OSS), 115-119, 2020 | 6 | 2020 |
Constrained Decoding for Computationally Efficient Named Entity Recognition Taggers B Lester, D Pressel, A Hemmeter, SR Choudhury, S Bangalore Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 | 5 | 2020 |
Training LLMs over Neurally Compressed Text B Lester, J Lee, A Alemi, J Pennington, A Roberts, J Sohl-Dickstein, ... arXiv preprint arXiv:2404.03626, 2024 | 2 | 2024 |
Frozen Model Adaptation Through Soft Prompt Transfer TT Vu, DM Cer, N Constant, BD Lester, R Al-Rfou US Patent App. 17/863,840, 2024 | 1 | 2024 |
Dynamically Adjusting a Voice Recognition System B Lester, SM Panainte US Patent 9,984,688, 2018 | 1 | 2018 |
Prompt Tuning Using One or More Machine-Learned Models BD Lester, RES Al-rfou, NJ Constant US Patent App. 18/684,518, 2024 | | 2024 |
Realistic Evaluation of Model Merging for Compositional Generalization D Tam, Y Kant, B Lester, I Gilitschenski, C Raffel arXiv preprint arXiv:2409.18314, 2024 | | 2024 |
Intent Features for Rich Natural Language Understanding B Lester, SR Choudhury, R Prasad, S Bangalore Proceedings of the 2021 Conference of the North American Chapter of the …, 2021 | | 2021 |
Leader: Prefixing a Length for Faster Word Vector Serialization B Lester arXiv preprint arXiv:2009.13699, 2020 | | 2020 |