Defending ChatGPT against jailbreak attack via self-reminders Y Xie*, J Yi*, J Shao, J Curl, L Lyu, Q Chen, X Xie, F Wu Nature Machine Intelligence, 1-11, 2023 | 138* | 2023 |
Efficient-FedRec: Efficient Federated Learning Framework for Privacy-Preserving News Recommendation J Yi, F Wu, C Wu, R Liu, G Sun, X Xie Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 64 | 2021 |
Benchmarking and defending against indirect prompt injection attacks on large language models J Yi*, Y Xie*, B Zhu, E Kiciman, G Sun, X Xie, F Wu The 31th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023 | 47 | 2023 |
Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor Watermark W Peng*, J Yi*, F Wu, S Wu, B Zhu, L Lyu, B Jiao, T Xu, G Sun, X Xie Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023 | 46* | 2023 |
Tiny-newsrec: Effective and efficient plm-based news recommendation Y Yu, F Wu, C Wu, J Yi, Q Liu Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2021 | 35* | 2021 |
UA-FedRec: Untargeted Attack on Federated News Recommendation J Yi, F Wu, B Zhu, J Yao, Z Tao, G Sun, X Xie The 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2022 | 16 | 2022 |
Debiasedrec: Bias-aware user modeling and click prediction for personalized news recommendation J Yi, F Wu, C Wu, Q Li, G Sun, X Xie arXiv preprint arXiv:2104.07360, 2021 | 11 | 2021 |
On the Vulnerability of Safety Alignment in Open-Access LLMs J Yi*, R Ye*, Q Chen, BB Zhu, S Chen, D Lian, G Sun, X Xie, F Wu | 10* | 2023 |
Non-IID always Bad? Semi-Supervised Heterogeneous Federated Learning with Local Knowledge Enhancement C Zhang, F Wu, J Yi, D Xu, Y Yu, J Wang, Y Wang, T Xu, X Xie, E Chen Proceedings of the 32nd ACM International Conference on Information and …, 2023 | 7 | 2023 |
Control Risk for Potential Misuse of Artificial Intelligence in Science J He*, W Feng*, Y Min*, J Yi*, K Tang, S Li, J Zhang, K Chen, W Zhou, ... arXiv preprint arXiv:2312.06632, 2023 | 5 | 2023 |
Effective and Efficient Query-aware Snippet Extraction for Web Search J Yi, F Wu, C Wu, X Huang, B Jiao, G Sun, X Xie Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 3 | 2022 |
Robust Quantity-Aware Aggregation for Federated Learning J Yi, F Wu, H Zhang, B Zhu, G Sun, X Xie arXiv preprint arXiv:2205.10848, 2022 | 2 | 2022 |
Elephant in the Room: Unveiling the Impact of Reward Model Quality in Alignment Y Liu, X Yi, X Chen, J Yao, J Yi, D Zan, Z Liu, X Xie, TY Ho arXiv preprint arXiv:2409.19024, 2024 | | 2024 |
Measuring Human Contribution in AI-Assisted Content Generation Y Xie, T Qi, J Yi, R Whalen, J Huang, Q Ding, Y Xie, X Xie, F Wu arXiv preprint arXiv:2408.14792, 2024 | | 2024 |