Follow
Jidong Zhai
Title
Cited by
Cited by
Year
Glm-130b: An open bilingual pre-trained model
A Zeng, X Liu, Z Du, Z Wang, H Lai, M Ding, Z Yang, Y Xu, W Zheng, X Xia, ...
arXiv preprint arXiv:2210.02414, 2022
5182022
Phantom: predicting performance of parallel applications on large-scale parallel machines using a single node
J Zhai, W Chen, W Zheng
ACM sigplan notices 45 (5), 305-314, 2010
205*2010
Cloud versus in-house cluster: evaluating amazon cluster compute instances for running mpi applications
Y Zhai, M Liu, J Zhai, X Ma, W Chen
State of the practice reports, 1-10, 2011
1612011
Understanding co-running behaviors on integrated CPU/GPU architectures
F Zhang, J Zhai, B He, S Zhang, W Chen
IEEE Transactions on Parallel and Distributed Systems 28 (3), 905-918, 2016
1182016
POCLib: A high-performance framework for enabling near orthogonal processing on compression
F Zhang, J Zhai, X Shen, O Mutlu, X Du
IEEE transactions on Parallel and Distributed Systems 33 (2), 459-475, 2021
922021
Fastmoe: A fast mixture-of-expert training system
J He, J Qiu, A Zeng, Z Yang, J Zhai, J Tang
arXiv preprint arXiv:2103.13262, 2021
832021
Understanding and bridging the gaps in current GNN performance optimizations
K Huang, J Zhai, Z Zheng, Y Yi, X Shen
Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of …, 2021
832021
Graphpi: High performance graph pattern matching through effective redundancy elimination
T Shi, M Zhai, Y Xu, J Zhai
SC20: International Conference for High Performance Computing, Networking …, 2020
742020
Zwift: A programming framework for high performance text analytics on compressed data
F Zhang, J Zhai, X Shen, O Mutlu, W Chen
Proceedings of the 2018 International Conference on Supercomputing, 195-206, 2018
712018
{PET}: Optimizing tensor programs with partially equivalent transformations and automated corrections
H Wang, J Zhai, M Gao, Z Ma, S Tang, L Zheng, Y Li, K Rong, Y Chen, ...
15th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2021
682021
AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures
Z Zheng, X Yang, P Zhao, G Long, K Zhu, F Zhu, W Zhao, X Liu, J Yang, ...
Proceedings of the 27th ACM International Conference on Architectural …, 2022
612022
Process mapping for mpi collective communications
J Zhang, J Zhai, W Chen, W Zheng
Euro-Par 2009 Parallel Processing: 15th International Euro-Par Conference …, 2009
602009
Cost-effective cloud HPC resource provisioning by building semi-elastic virtual clusters
S Niu, J Zhai, X Ma, X Tang, W Chen
Proceedings of the International Conference on High Performance Computing …, 2013
592013
FinePar: Irregularity-aware fine-grained workload partitioning on integrated architectures
F Zhang, B Wu, J Zhai, B He, W Chen
2017 IEEE/ACM International Symposium on Code Generation and Optimization …, 2017
582017
Fastermoe: modeling and optimizing training of large-scale dynamic pre-trained models
J He, J Zhai, T Antunes, H Wang, F Luo, S Shi, Q Li
Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of …, 2022
572022
TADOC: Text analytics directly on compression
F Zhang, J Zhai, X Shen, D Wang, Z Chen, O Mutlu, W Chen, X Du
The VLDB Journal 30, 163-188, 2021
562021
Norms of the screen for child anxiety related emotional disorders in Chinese urban children
K Wang
Chinese Journal of Clinical Psychology 10 (4), 270-272, 2002
562002
BaGuaLu: targeting brain scale pretrained models with over 37 million cores
Z Ma, J He, J Qiu, H Cao, Y Wang, Z Sun, L Zheng, H Wang, S Tang, ...
Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of …, 2022
522022
Bitflow: Exploiting vector parallelism for binary neural networks on cpu
Y Hu, J Zhai, D Li, Y Gong, Y Zhu, W Liu, L Su, J Jin
2018 IEEE international parallel and distributed processing symposium (IPDPS …, 2018
502018
Scalable graph traversal on sunway taihulight with ten million cores
H Lin, X Tang, B Yu, Y Zhuo, W Chen, J Zhai, W Yin, W Zheng
2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2017
462017
The system can't perform the operation now. Try again later.
Articles 1–20