Volgen
Hao Zhang
Hao Zhang
Geverifieerd e-mailadres voor ucsd.edu - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Judging llm-as-a-judge with mt-bench and chatbot arena
L Zheng, WL Chiang, Y Sheng, S Zhuang, Z Wu, Y Zhuang, Z Lin, Z Li, ...
Advances in Neural Information Processing Systems 36, 46595-46623, 2023
2441*2023
Vicuna: An open-source chatbot impressing gpt-4 with 90%* chatgpt quality
WL Chiang, Z Li, Z Lin, Y Sheng, Z Wu, H Zhang, L Zheng, S Zhuang, ...
See https://vicuna. lmsys. org (accessed 14 April 2023) 2 (3), 6, 2023
2165*2023
Efficient memory management for large language model serving with pagedattention
W Kwon, Z Li, S Zhuang, Y Sheng, L Zheng, CH Yu, J Gonzalez, H Zhang, ...
Proceedings of the 29th Symposium on Operating Systems Principles, 611-626, 2023
9482023
HD-CNN: hierarchical deep convolutional neural networks for large scale visual recognition
Z Yan, H Zhang, R Piramuthu, V Jagadeesh, D DeCoste, W Di, Y Yu
Proceedings of the IEEE international conference on computer vision, 2740-2748, 2015
664*2015
Poseidon: An efficient communication architecture for distributed deep learning on {GPU} clusters
H Zhang, Z Zheng, S Xu, W Dai, Q Ho, X Liang, Z Hu, J Wei, P Xie, ...
2017 USENIX Annual Technical Conference (USENIX ATC 17), 181-193, 2017
497*2017
Geeps: Scalable deep learning on distributed gpus with a gpu-specialized parameter server
H Cui, H Zhang, GR Ganger, PB Gibbons, EP Xing
Proceedings of the eleventh european conference on computer systems, 1-16, 2016
4092016
Automatic photo adjustment using deep neural networks
Z Yan, H Zhang, B Wang, S Paris, Y Yu
ACM Transactions on Graphics (TOG) 35 (2), 1-15, 2016
3222016
Alpa: Automating inter-and {Intra-Operator} parallelism for distributed deep learning
L Zheng, Z Li, H Zhang, Y Zhuang, Z Chen, Y Huang, Y Wang, Y Xu, ...
16th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2022
3122022
Scan: Structure correcting adversarial network for organ segmentation in chest x-rays
W Dai, N Dong, Z Wang, X Liang, H Zhang, EP Xing
International Workshop on Deep Learning in Medical Image Analysis, 263-273, 2018
2792018
Recurrent topic-transition gan for visual paragraph generation
X Liang, Z Hu, H Zhang, C Gan, EP Xing
Proceedings of the IEEE international conference on computer vision, 3362-3371, 2017
2542017
Chatbot arena: An open platform for evaluating llms by human preference
WL Chiang, L Zheng, Y Sheng, AN Angelopoulos, T Li, D Li, H Zhang, ...
arXiv preprint arXiv:2403.04132, 2024
2442024
Symbolic graph reasoning meets convolutions
X Liang, Z Hu, H Zhang, L Lin, EP Xing
Advances in neural information processing systems 31, 2018
1892018
Pollux: Co-adaptive cluster scheduling for goodput-optimized deep learning
A Qiao, SK Choe, SJ Subramanya, W Neiswanger, Q Ho, H Zhang, ...
15th {USENIX} Symposium on Operating Systems Design and Implementation …, 2021
1792021
A simple framework for open-vocabulary segmentation and detection
H Zhang, F Li, X Zou, S Liu, C Li, J Yang, L Zhang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
1292023
Generative semantic manipulation with mask-contrasting gan
X Liang, H Zhang, L Lin, E Xing
Proceedings of the European Conference on Computer Vision (ECCV), 558-573, 2018
120*2018
{AlpaServe}: Statistical multiplexing with model parallelism for deep learning serving
Z Li, L Zheng, Y Zhong, V Liu, Y Sheng, X Jin, Y Huang, Z Chen, H Zhang, ...
17th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2023
1152023
How Long Can Context Length of Open-Source LLMs truly Promise?
D Li, R Shao, A Xie, Y Sheng, L Zheng, J Gonzalez, I Stoica, X Ma, ...
NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following, 2023
112*2023
Terapipe: Token-level pipeline parallelism for training large-scale language models
Z Li, S Zhuang, S Guo, D Zhuo, H Zhang, D Song, I Stoica
International Conference on Machine Learning, 6543-6552, 2021
972021
Toward understanding the impact of staleness in distributed machine learning
W Dai, Y Zhou, N Dong, H Zhang, EP Xing
arXiv preprint arXiv:1810.03264, 2018
912018
Break the sequential dependency of llm inference using lookahead decoding
Y Fu, P Bailis, I Stoica, H Zhang
arXiv preprint arXiv:2402.02057, 2024
78*2024
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20