Yangsibo Huang

引用次数

	总计	2019 年至今
引用	1450	1448
h 指数	15	15
i10 指数	18	18

860

430

215

645

2019202020212022202320246 37 93 162 291 850

开放获取的出版物数量

查看全部

5 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Kai LiProfessor of computer science, Princeton university在 cs.princeton.edu 的电子邮件经过验证
Danqi ChenPrinceton University在 cs.princeton.edu 的电子邮件经过验证
Sanjeev AroraProfessor of Computer Science, Princeton University在 cs.princeton.edu 的电子邮件经过验证
Peter HendersonPrinceton University在 princeton.edu 的电子邮件经过验证
Luke ZettlemoyerUniversity of Washington; Meta在 cs.washington.edu 的电子邮件经过验证
Chiyuan ZhangGoogle Research在 google.com 的电子邮件经过验证
Mengdi WangCenter for Statistics & Machine Learning, ECE, Princeton University在 princeton.edu 的电子邮件经过验证
Arvind NarayananProfessor, Princeton University在 cs.princeton.edu 的电子邮件经过验证
Matthew JagielskiGoogle DeepMind在 google.com 的电子邮件经过验证
Andreas TerzisGoogle Deepmind在 google.com 的电子邮件经过验证
Roxana GeambasuAssistant Professor of Computer Science, Columbia University在 columbia.edu 的电子邮件经过验证
Badih GhaziGoogle在 google.com 的电子邮件经过验证
Noah A. SmithUniversity of Washington; Allen Institute for Artificial Intelligence在 cs.washington.edu 的电子邮件经过验证
Percy LiangAssociate Professor of Computer Science, Stanford University在 cs.stanford.edu 的电子邮件经过验证
Ravi KumarGoogle在 google.com 的电子邮件经过验证
Dawn SongProfessor of Computer Science, UC Berkeley在 cs.berkeley.edu 的电子邮件经过验证
Florian TramèrAssistant Professor of Computer Science, ETH Zurich在 inf.ethz.ch 的电子邮件经过验证
Milad NasrGoogle DeepMind在 srxzr.com 的电子邮件经过验证
Xinyun ChenGoogle DeepMind在 berkeley.edu 的电子邮件经过验证
Sewoong OhPaul G. Allen School of Computer Science and Engineering, University of Washington在 cs.washington.edu 的电子邮件经过验证

关注

Yangsibo Huang

Google

在 google.com 的电子邮件经过验证 - 首页

ML Security Privacy


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Evaluating Gradient Inversion Attacks and Defenses in Federated Learning Y Huang, S Gupta, Z Song, K Li, S Arora NeurIPS 2021, 2021	268	2021
Catastrophic Jailbreak of Open-Source LLMs via Exploiting Generation Y Huang, S Gupta, M Xia, K Li, D Chen ICLR 2024, 2024	192	2024
Detecting pretraining data from large language models W Shi, A Ajith, M Xia, Y Huang, D Liu, T Blevins, D Chen, L Zettlemoyer ICLR 2024, 2024	175	2024
Deep Q learning driven CT pancreas segmentation with geometry-aware U-Net Y Man, Y Huang, J Feng, X Li, F Wu IEEE Transactions on Medical Imaging, 2019	168	2019
Instahide: Instance-hiding schemes for private distributed learning Y Huang, Z Song, K Li, S Arora ICML 2020, 2020	163	2020
Recovering Private Text in Federated Learning of Language Models S Gupta, Y Huang, Z Zhong, T Gao, K Li, D Chen NeurIPS 2022, 2022	77	2022
TextHide: Tackling Data Privacy in Language Understanding Tasks Y Huang, Z Song, D Chen, K Li, S Arora EMNLP 2020, 2020	61	2020
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications B Wei, K Huang, Y Huang*, T Xie, X Qi, M Xia, P Mittal, M Wang, ... ICML 2024, 2024	53	2024
Advancing differential privacy: Where we are now and future directions for real-world deployment R Cummings, D Desfontaines, D Evans, R Geambasu, Y Huang, ... Harvard Data Science Review, 2024	53*	2024
DeepMC: a deep learning method for efficient Monte Carlo beamlet dose calculation by predictive denoising in magnetic resonance-guided radiotherapy R Neph, Q Lyu, Y Huang, YM Yang, K Sheng Physics in Medicine & Biology 66 (3), 035022, 2021	43*	2021
Privacy Implications of Retrieval-Based Language Models Y Huang, S Gupta, Z Zhong, K Li, D Chen EMNLP 2023, 2023	28	2023
Privacy-Preserving Learning via Deep Net Pruning Y Huang, Y Su, S Ravi, Z Song, S Arora, K Li arXiv preprint arXiv:2003.01876, 2020	27*	2020
MUSE: Machine Unlearning Six-way Evaluation for Language Models W Shi, J Lee, Y Huang, S Malladi, J Zhao, A Holtzman, D Liu, ... arXiv preprint arXiv:2407.06460, 2024	21	2024
A Safe Harbor for AI Evaluation and Red Teaming S Longpre, S Kapoor, K Klyman, A Ramaswami, R Bommasani, ... ICML 2024, 2024	21	2024
A Dataset Auditing Method for Collaboratively Trained Machine Learning Models Y Huang, CY Huang, X Li, K Li IEEE Transactions on Medical Imaging, 2022	15	2022
NN-Adapter: Efficient Domain Adaptation for Black-Box Language Models Y Huang, D Liu, Z Zhong, W Shi, YT Lee arXiv preprint arXiv:2302.10879, 2023	14	2023
SORRY-bench: Systematically evaluating large language model safety refusal behaviors T Xie, X Qi, Y Zeng, Y Huang, UM Sehwag, K Huang, L He, B Wei, D Li, ... arXiv preprint arXiv:2406.14598, 2024	13	2024
: Auditing Data Removal from Trained Models Y Huang, X Li, K Li International Conference on Medical Image Computing and Computer-Assisted …, 2021	11	2021
Evaluating Copyright Takedown Methods for Language Models B Wei, W Shi, Y Huang, NA Smith, C Zhang, L Zettlemoyer, K Li, ... arXiv preprint arXiv:2406.18664, 2024	9	2024
AI Risk Management Should Incorporate Both Safety and Security X Qi, Y Huang, Y Zeng, E Debenedetti, J Geiping, L He, K Huang, ... arXiv preprint arXiv:2405.19524, 2024	9	2024

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

关注

引用次数

合著作者