Segui
Xuhui Zhou
Xuhui Zhou
Email verificata su cs.cmu.edu - Home page
Titolo
Citata da
Citata da
Anno
Annotators with attitudes: How annotator beliefs and identities bias toxic language detection
M Sap, S Swayamdipta, L Vianna, X Zhou, Y Choi, NA Smith
Proceedings of the 2022 Conference of the North American Chapter of the …, 2021
2402021
Webarena: A realistic web environment for building autonomous agents
S Zhou, FF Xu, H Zhu, X Zhou, R Lo, A Sridhar, X Cheng, T Ou, Y Bisk, ...
arXiv preprint arXiv:2307.13854, 2023
2142023
Evaluating commonsense in pre-trained language models
X Zhou, Y Zhang, L Cui, D Huang
Proceedings of the AAAI conference on artificial intelligence 34 (05), 9733-9740, 2020
2102020
Challenges in automated debiasing for toxic language detection
X Zhou
Proceedings of the 16th Conference of the European Chapter of the …, 2021
1542021
Clever hans or neural theory of mind? stress testing social reasoning in large language models
N Shapira, M Levy, SH Alavi, X Zhou, Y Choi, Y Goldberg, M Sap, ...
arXiv preprint arXiv:2305.14763, 2023
992023
Sotopia: Interactive evaluation for social intelligence in language agents
X Zhou, H Zhu, L Mathur, R Zhang, H Yu, Z Qi, LP Morency, Y Bisk, ...
arXiv preprint arXiv:2310.11667, 2023
772023
Can llms keep a secret? testing privacy implications of language models via contextual integrity theory
N Mireshghallah, H Kim, X Zhou, Y Tsvetkov, M Sap, R Shokri, Y Choi
arXiv preprint arXiv:2310.17884, 2023
492023
FANToM: A benchmark for stress-testing machine theory of mind in interactions
H Kim, M Sclar, X Zhou, RL Bras, G Kim, Y Choi, M Sap
arXiv preprint arXiv:2310.15421, 2023
482023
Linguistically-informed transformations (LIT): A method for automatically generating contrast sets
C Li, L Shengshuo, LZ Liu, X Wu, X Zhou, S Steinert-Threlkeld
Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting …, 2020
342020
Tianyue Ou, Yonatan Bisk, Daniel Fried, Uri Alon, and Graham Neubig
S Zhou, FF Xu, H Zhu, X Zhou, R Lo, A Sridhar, X Cheng
Webarena: A realistic web environment for building autonomous agents 2 (3), 4, 2023
252023
Cobra frames: Contextual reasoning about effects and harms of offensive statements
X Zhou, H Zhu, A Yerukola, T Davidson, JD Hwang, S Swayamdipta, ...
Proceedings of the Association for Computational Linguistics (ACL), 2023
222023
Multilevel text alignment with cross-document attention
X Zhou, N Pappas, NA Smith
Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020
202020
Tianyue Ou, Yonatan Bisk, Daniel Fried, Uri Alon, and Graham Neubig. 2023. WebArena: A Realistic Web Environment for Building Autonomous Agents
S Zhou, FF Xu, H Zhu, X Zhou, R Lo, A Sridhar, X Cheng
arXiv preprint arXiv:2307.13854, 0
20
Is this the real life? is this just fantasy? the misleading success of simulating social interactions with llms
X Zhou, Z Su, T Eisape, H Kim, M Sap
arXiv preprint arXiv:2403.05020, 2024
162024
Consent in crisis: the rapid decline of the AI data commons
S Longpre, R Mahari, AN Lee, CS Lund, H Oderinwale, W Brannon, ...
The Thirty-eight Conference on Neural Information Processing Systems …, 2024
142024
Extracting and inferring personal attributes from dialogue
Z Wang
Proceedings of the 4th Workshop on NLP for Conversational AI, 2021
142021
Emergent Communication Fine-tuning (EC-FT) for Pretrained Language Models
S Steinert-Threlkeld, X Zhou, Z Liu, CM Downey
Emergent Communication Workshop at ICLR 2022, 2022
122022
Clever hans or neural theory of mind
N Shapira, M Levy, SH Alavi, X Zhou, Y Choi, Y Goldberg, M Sap, ...
Stress testing social reasoning in large language models, 2023
112023
PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models
D Jain, P Kumar, S Gehman, X Zhou, T Hartvigsen, M Sap
arXiv preprint arXiv:2405.09373, 2024
92024
Don't Take This Out of Context! On the Need for Contextual Models and Evaluations for Stylistic Rewriting
A Yerukola, X Zhou, E Clark, M Sap
arXiv preprint arXiv:2305.14755, 2023
52023
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20