Symbolic knowledge distillation: from general language models to commonsense models P West, C Bhagavatula, J Hessel, JD Hwang, L Jiang, RL Bras, X Lu, ... NAACL 2022, 2021 | 285 | 2021 |
Faith and fate: Limits of transformers on compositionality N Dziri, X Lu, M Sclar, XL Li, L Jiang, BY Lin, S Welleck, P West, ... NeurIPS 2023, 2024 | 270 | 2024 |
Quizbot: A dialogue-based adaptive learning system for factual knowledge S Ruan, L Jiang, J Xu, BJK Tham, Z Qiu, Y Zhu, EL Murnane, E Brunskill, ... CHI 2019, 2019 | 219 | 2019 |
Can machines learn morality? the delphi experiment L Jiang, JD Hwang, C Bhagavatula, RL Bras, J Liang, J Dodge, ... Accepted in Principle to Nature Machine Intelligence, 2021 | 217* | 2021 |
Quark: Controllable text generation with reinforced unlearning X Lu, S Welleck, J Hessel, L Jiang, L Qin, P West, P Ammanabrolu, Y Choi NeurIPS 2022, 2022 | 165 | 2022 |
Neurologic a* esque decoding: Constrained text generation with lookahead heuristics X Lu, S Welleck, P West, L Jiang, J Kasai, D Khashabi, RL Bras, L Qin, ... NAACL 2022, 2021 | 151 | 2021 |
Soda: Million-scale dialogue distillation with social commonsense contextualization H Kim, J Hessel, L Jiang, P West, X Lu, Y Yu, P Zhou, RL Bras, M Alikhani, ... EMNLP 2023, 2022 | 117 | 2022 |
Bookbuddy: Turning digital materials into interactive foreign language lessons through a voice chatbot S Ruan, A Willis, Q Xu, GM Davis, L Jiang, E Brunskill, JA Landay Proceedings of the sixth (2019) ACM conference on learning@ scale, 1-4, 2019 | 108 | 2019 |
Prosocialdialog: A prosocial backbone for conversational agents H Kim, Y Yu, L Jiang, X Lu, D Khashabi, G Kim, Y Choi, M Sap EMNLP 2022, 2022 | 98 | 2022 |
Englishbot: An ai-powered conversational system for second language learning S Ruan, L Jiang, Q Xu, Z Liu, GM Davis, E Brunskill, JA Landay IUI 2021, 2021 | 72 | 2021 |
Value kaleidoscope: Engaging ai with pluralistic human values, rights, and duties T Sorensen, L Jiang, JD Hwang, S Levine, V Pyatkin, P West, N Dziri, ... AAAI 2024, 2024 | 65* | 2024 |
A roadmap to pluralistic alignment T Sorensen, J Moore, J Fisher, M Gordon, N Mireshghallah, CM Rytting, ... ICML 2024, 2024 | 60* | 2024 |
ClarifyDelphi: Reinforced clarification questions with defeasibility rewards for social and moral situations V Pyatkin, JD Hwang, V Srikumar, X Lu, L Jiang, Y Choi, C Bhagavatula ACL 2023, 2022 | 37* | 2022 |
Aligning to social norms and values in interactive narratives P Ammanabrolu, L Jiang, M Sap, H Hajishirzi, Y Choi NAACL 2022, 2022 | 36 | 2022 |
"I'm Not Mad": Commonsense Implications of Negation and Contradiction L Jiang, A Bosselut, C Bhagavatula, Y Choi NAACL 2021, 2021 | 35 | 2021 |
Phenomenal yet puzzling: Testing inductive reasoning capabilities of language models with hypothesis refinement L Qiu, L Jiang, X Lu, M Sclar, V Pyatkin, C Bhagavatula, B Wang, Y Kim, ... ICLR 2024, 2023 | 31 | 2023 |
Wildguard: Open one-stop moderation tools for safety risks, jailbreaks, and refusals of llms S Han, K Rao, A Ettinger, L Jiang, BY Lin, N Lambert, Y Choi, N Dziri NeurIPS D&B 2024, 2024 | 19 | 2024 |
THE GENERATIVE AI PARADOX:“What It Can Create, It May Not Understand” P West, X Lu, N Dziri, F Brahman, L Li, JD Hwang, L Jiang, J Fisher, ... ICLR 2024, 2023 | 12 | 2023 |
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models L Jiang, K Rao, S Han, A Ettinger, F Brahman, S Kumar, N Mireshghallah, ... NeurIPS 2024, 2024 | 10* | 2024 |
CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs'(Lack of) Multicultural Knowledge YY Chiu, L Jiang, M Antoniak, CY Park, SS Li, M Bhatia, S Ravi, ... arXiv preprint arXiv:2404.06664, 2024 | 9 | 2024 |