Minjia Zhang

Sitert av

	Alle	Siden 2019
Sitater	4989	4576
h-indeks	29	26
i10-indeks	49	46

2300

1150

575

1725

201220132014201520162017201820192020202120222023202420 23 31 68 83 93 70 120 131 183 339 1530 2251

Offentlig tilgang

Vis alle

16 artikler

0 artikler

tilgjengelige

ikke tilgjengelige

Basert på finansieringsmandater

Medforfattere

He YuxiongSnowflakeVerifisert e-postadresse på snowflake.com
Conglong LiSenior Research Scientist at Google DeepMind, CMU Ph.D.Verifisert e-postadresse på google.com
Reza Yazdani AminabadiMicrosoft ResearchVerifisert e-postadresse på microsoft.com
Michael D. BondOhio State UniversityVerifisert e-postadresse på cse.ohio-state.edu
Zhewei YaoSnowflakeVerifisert e-postadresse på snowflake.com
Olatunji RuwaseMicrosoft ResearchVerifisert e-postadresse på microsoft.com
Xiaoxia (Shirley) Wu 吴晓霞MicrosoftVerifisert e-postadresse på microsoft.com
Swarnendu BiswasAssistant Professor, IIT KanpurVerifisert e-postadresse på cse.iitk.ac.in
Dong LiUniversity of California, MercedVerifisert e-postadresse på ucmerced.edu
Cheng LiDatabricksVerifisert e-postadresse på databricks.com
Jeff RasleyMicrosoftVerifisert e-postadresse på microsoft.com
Ammar Ahmad AwanMicrosoftVerifisert e-postadresse på osu.edu
Man CaoGoogle, Ohio State UniversityVerifisert e-postadresse på google.com
Connor HolmesOpenAIVerifisert e-postadresse på openai.com
Jie RenWilliam & MaryVerifisert e-postadresse på wm.edu
Aritra SenguptaAutomated Reasoning Group, AWS.Verifisert e-postadresse på cse.ohio-state.edu
Di WangMicrosoftVerifisert e-postadresse på microsoft.com
Jianfeng GaoMicrosoft Research, RedmondVerifisert e-postadresse på microsoft.com
Milind KulkarniAssociate Professor of Electrical and Computer Engineering, Purdue UniversityVerifisert e-postadresse på purdue.edu
Jipeng HuangGoogleVerifisert e-postadresse på cse.ohio-state.edu

Følg

Minjia Zhang

University of Illinois at Urbana-Champagin

Verifisert e-postadresse på illinois.edu - Startside

Parallelism Machine Learning Systems Model Compression LLM Application


Tittel Sorter etter sitater Sorter etter år Sorter etter tittel	Sitert av Sitert av	År
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...	1604	2023
{Zero-offload}: Democratizing {billion-scale} model training J Ren, S Rajbhandari, RY Aminabadi, O Ruwase, S Yang, M Zhang, D Li, ... 2021 USENIX Annual Technical Conference (USENIX ATC 21), 551-564, 2021	361	2021
Zeroquant: Efficient and affordable post-training quantization for large-scale transformers Z Yao, R Yazdani Aminabadi, M Zhang, X Wu, C Li, Y He Advances in Neural Information Processing Systems 35, 27168-27183, 2022	326	2022
Memcached design on high performance RDMA capable interconnects J Jose, H Subramoni, M Luo, M Zhang, J Huang, M Wasi-ur-Rahman, ... 2011 International Conference on Parallel Processing, 743-752, 2011	267	2011
Deepspeed-inference: enabling efficient inference of transformer models at unprecedented scale RY Aminabadi, S Rajbhandari, AA Awan, C Li, D Li, E Zheng, O Ruwase, ... SC22: International Conference for High Performance Computing, Networking …, 2022	261	2022
Deepspeed-moe: Advancing mixture-of-experts inference and training to power next-generation ai scale S Rajbhandari, C Li, Z Yao, M Zhang, RY Aminabadi, AA Awan, J Rasley, ... International conference on machine learning, 18332-18346, 2022	223	2022
OpenFold: Retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization G Ahdritz, N Bouatta, C Floristean, S Kadyan, Q Xia, W Gerecke, ... Nature Methods, 1-11, 2024	181	2024
Learning intrinsic sparse structures within long short-term memory W Wen, Y He, S Rajbhandari, M Zhang, W Wang, F Liu, B Hu, Y Chen, ... arXiv preprint arXiv:1709.05027, 2017	156	2017
{DeepCPU}: Serving {RNN-based} Deep Learning Models 10x Faster M Zhang, S Rajbhandari, W Wang, Y He 2018 USENIX Annual Technical Conference (USENIX ATC 18), 951-965, 2018	124	2018
Accelerating training of transformer-based language models with progressive layer dropping M Zhang, Y He Advances in neural information processing systems 33, 14011-14023, 2020	107	2020
Model tells you what to discard: Adaptive kv cache compression for llms S Ge, Y Zhang, L Liu, M Zhang, J Han, J Gao arXiv preprint arXiv:2310.01801, 2023	101	2023
Valor: Efficient, software-only region conflict exceptions S Biswas, M Zhang, MD Bond, B Lucia ACM SIGPLAN Notices 50 (10), 241-259, 2015	74	2015
Sentinel: Efficient tensor migration and allocation on heterogeneous memory systems for deep learning J Ren, J Luo, K Wu, M Zhang, H Jeon, D Li 2021 IEEE International Symposium on High-Performance Computer Architecture …, 2021	64	2021
Hm-ann: Efficient billion-point nearest neighbor search on heterogeneous memory J Ren, M Zhang, D Li Advances in Neural Information Processing Systems 33, 10672-10684, 2020	59	2020
Octet: Capturing and controlling cross-thread dependences efficiently MD Bond, M Kulkarni, M Cao, M Zhang, M Fathi Salmi, S Biswas, ... ACM SIGPLAN Notices 48 (10), 693-712, 2013	58	2013
Improving approximate nearest neighbor search through learned adaptive early termination C Li, M Zhang, DG Andersen, Y He Proceedings of the 2020 ACM SIGMOD International Conference on Management of …, 2020	56	2020
Bamboo: Making preemptible instances resilient for affordable training of large {DNNs} J Thorpe, P Zhao, J Eyolfson, Y Qiao, Z Jia, M Zhang, R Netravali, GH Xu 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2023	54	2023
Deepspeed-chat: Easy, fast and affordable rlhf training of chatgpt-like models at all scales Z Yao, RY Aminabadi, O Ruwase, S Rajbhandari, X Wu, AA Awan, ... arXiv preprint arXiv:2308.01320, 2023	53	2023
Navigating with graph representations for fast and scalable decoding of neural language models M Zhang, W Wang, X Liu, J Gao, Y He Advances in neural information processing systems 31, 2018	50	2018
Hybrid static–dynamic analysis for statically bounded region serializability A Sengupta, S Biswas, M Zhang, MD Bond, M Kulkarni ACM SIGPLAN Notices 50 (4), 561-575, 2015	45	2015

Systemet kan ikke utføre handlingen. Prøv på nytt senere.

Artikler 1–20

Sitater per år

Duplikatsitater

Sammenslåtte sitater

Legg til medforfattereMedforfattere

Følg

Sitert av

Medforfattere