‪Zidi Xiong‬ - ‪Google Scholar‬

Utwórz swój profil

Cytowane przez

	Wszystkie	Od 2020
Cytowania	613	609
h-indeks	9	9
i10-indeks	7	7

0

460

230

115

345

20232024202554 459 90

Dostęp publiczny

Wyświetl wszystko

3 artykuły

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Zidi Xiong

Zidi Xiong

Harvard University

Zweryfikowany adres z g.harvard.edu - Strona główna

Trustworthy machine learning


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models. B Wang, W Chen, H Pei, C Xie, M Kang, C Zhang, C Xu, Z Xiong, R Dutta, ... NeurIPS, 2023	435	2023
Badchain: Backdoor chain-of-thought prompting for large language models Z Xiang, F Jiang, Z Xiong, B Ramasubramanian, R Poovendran, B Li arXiv preprint arXiv:2401.12242, 2024	64	2024
Rigorllm: Resilient guardrails for large language models against undesired content Z Yuan, Z Xiong, Y Zeng, N Yu, R Jia, D Song, B Li arXiv preprint arXiv:2403.13031, 2024	36	2024
Umd: Unsupervised model detection for x2x backdoor attacks Z Xiang, Z Xiong, B Li International Conference on Machine Learning, 38013-38038, 2023	17	2023
Decodingtrust: A comprehensive assessment of trustworthiness in gpt models, 2024 B Wang, W Chen, H Pei, C Xie, M Kang, C Zhang, C Xu, Z Xiong, R Dutta, ... Cited on, 28, 2023	13	2023
Guardagent: Safeguard llm agents by a guard agent via knowledge-enabled reasoning Z Xiang, L Zheng, Y Li, J Hong, Q Li, H Xie, J Zhang, Z Xiong, C Xie, ... arXiv preprint arXiv:2406.09187, 2024	11	2024
CBD: A certified backdoor detector based on local dominant probability Z Xiang, Z Xiong, B Li Advances in Neural Information Processing Systems 36, 4937-4951, 2023	10	2023
DecodingTrust: A comprehensive assessment of trustworthiness in GPT models. arXiv B Wang, W Chen, H Pei, C Xie, M Kang, C Zhang, C Xu, Z Xiong, R Dutta, ... arXiv preprint arXiv:2306.11698, 2024	9	2024
Label-smoothed backdoor attack M Peng, Z Xiong, M Sun, P Li arXiv e-prints, arXiv: 2202.11203, 2022	9	2022
Backdoor chain-of-thought prompting for large language models Z Xiang, F Jiang, Z Xiong, B Ramasubramanian, R Poovendran, BB Li Proceedings of the NeurIPS 2023 Workshop on Backdoors in Deep Learning—The …, 2023	8	2023
Rethinking the necessity of labels in backdoor removal Z Xiong, D Wu, Y Wang, Y Wang ICLR 2023 Workshop on Backdoor Attacks and Defenses in Machine Learning, 2023	1	2023
GuardAgent: Safeguard LLM Agent by a Guard Agent via Knowledge-Enabled Reasoning Z Xiang, L Zheng, Y Li, J Hong, Q Li, H Xie, J Zhang, Z Xiong, C Xie, ...
MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models C Xu, J Zhang, Z Chen, C Xie, M Kang, Z Yuan, Z Xiong, C Zhang, L Yuan, ... The Thirteenth International Conference on Learning Representations, 0

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–13