Obserwuj
Ryan A. Chi
Ryan A. Chi
Zweryfikowany adres z cs.stanford.edu - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
arXiv preprint arXiv:2206.04615, 2022
13932022
Holistic evaluation of language models
P Liang, R Bommasani, T Lee, D Tsipras, D Soylu, M Yasunaga, Y Zhang, ...
arXiv preprint arXiv:2211.09110, 2022
12462022
Premise Order Matters in Reasoning with Large Language Models
X Chen, RA Chi, X Wang, D Zhou
https://arxiv.org/abs/2402.08939, 2024
572024
Holistic evaluation of language models, 2022
P Liang, R Bommasani, T Lee, D Tsipras, D Soylu, M Yasunaga, Y Zhang, ...
URL https://arxiv. org/abs/2211.09110 3, 2023
402023
ModeLing: A novel dataset for testing linguistic reasoning in language models
NA Chi, T Malchev, R Kong, RA Chi, L Huang, EA Chi, RT McCoy, ...
arXiv preprint arXiv:2406.17038, 2024
82024
Lingoly: A benchmark of olympiad-level linguistic reasoning puzzles in low-resource and extinct languages
AM Bean, S Hellsten, H Mayne, J Magomere, EA Chi, R Chi, SA Hale, ...
arXiv preprint arXiv:2406.06196, 2024
72024
Dialogue distillery: Crafting interpolable, interpretable, and introspectable dialogue from llms
RA Chi, J Kim, S Hickmann, S Li, G Chi, T Atchariyachanvanit, K Yu, ...
Alexa Prize SocialBot Grand Challenge 5, 2023
62023
Stanford MLab at SemEval 2023 task 7: Neural methods for clinical trial report NLI
C Takehana, D Lim, E Kurtuluş, R Iyer, E Tanimura, P Aggarwal, ...
Proceedings of the 17th International Workshop on Semantic Evaluation …, 2023
22023
Redwoodnlp at semeval-2021 task 7: Ensembled pretrained and lightweight models for humor detection
N Chi, R Chi
Proceedings of the 15th international workshop on semantic evaluation …, 2021
22021
GLARE: Generative Left-to-right AdversaRial Examples
RA Chi, N Kim, P Liu, Z Lack, EA Chi
Proceedings of the 3rd Workshop on Evaluation and Comparison of NLP Systems …, 2022
2022
Stanford MLab at SemEval 2022 Task 7: Tree-and Transformer-Based Methods for Clarification Plausibility
T Yim, J Lee, R Verma, S Hickmann, A Zhu, C Sallade, I Ng, R Chi, P Liu
Proceedings of the 16th International Workshop on Semantic Evaluation …, 2022
2022
Automated Topic-Tagging for Software-Related Question-and-Answer Sites
A Agrawal, RA Chi, V Gupta
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–12