Follow
Peng Chen
Title
Cited by
Cited by
Year
Matrix engines for high performance computing: A paragon of performance or grasping at straws?
J Domke, E Vatai, A Drozd, P Chen, Y Oyama, L Zhang, S Salaria, ...
2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2021
382021
A versatile software systolic execution model for GPU memory-bound kernels
Peng Chen , Wahib Mohamed, Takizawa Shinichiro,Takano Ryousei, Matsuoka Satoshi
Proceedings of the International Conference for High Performance Computing …, 2019
292019
Boosting the predictive performance with aqueous solubility dataset curation
J Meng, P Chen, M Wahib, M Yang, L Zheng, Y Wei, S Feng, W Liu
Scientific Data 9 (1), 71, 2022
212022
Automatic generation of high-performance convolution kernels on ARM CPUs for deep learning
J Meng, C Zhuang, P Chen, M Wahib, B Schmidt, X Wang, H Lan, D Wu, ...
IEEE Transactions on Parallel and Distributed Systems 33 (11), 2885-2899, 2022
122022
iFDK: a scalable framework for instant high-resolution image reconstruction
Peng Chen , Wahib Mohamed, Takizawa Shinichiro,Takano Ryousei, Matsuoka Satoshi
Proceedings of the International Conference for High Performance Computing …, 2019
12*2019
Evolutionary architecture search for generative adversarial networks based on weight sharing
Y Xue, W Tong, F Neri, P Chen, T Luo, L Zhen, X Wang
IEEE Transactions on Evolutionary Computation, 2023
112023
At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads
J Domke, E Vatai, B Gerofi, Y Kodama, M Wahib, A Podobas, S Mittal, ...
arXiv preprint arXiv:2204.02235, 2022
92022
Scalable FBP decomposition for cone-beam CT reconstruction
P Chen, M Wahib, X Wang, T Hirofuchi, H Ogawa, A Biguri, R Boardman, ...
Proceedings of the International Conference for High Performance Computing …, 2021
92021
Efficient Algorithms for the Summed Area Tables Primitive on GPUs
Peng Chen , Wahib Mohamed, Takizawa Shinichiro,Takano Ryousei, Matsuoka Satoshi
IEEE International Conference on Cluster Computing (CLUSTER), 2018
92018
Physics-Based Iterative Reconstruction for Dual Source and Flying Focal Spot Computed Tomography
X Wang, RD MacDougall, P Chen, CA Bouman, SK Warfield
arXiv e-prints, arXiv: 2001.09471, 2021
6*2021
PERKS: a Locality-Optimized Execution Model for Iterative Memory-bound GPU Applications
L Zhang, M Wahib, P Chen, J Meng, X Wang, T Endo, S Matsuoka
Proceedings of the 37th International Conference on Supercomputing, 167-179, 2023
52023
Simeuro: A hybrid CPU-GPU parallel simulator for neuromorphic computing chips
H Zhang, NM Ho, DY Polat, P Chen, M Wahib, TT Nguyen, J Meng, ...
IEEE Transactions on Parallel and Distributed Systems 34 (10), 2767-2782, 2023
42023
Persistent Kernels for Iterative Memory-bound GPU Applications
L Zhang, M Wahib, P Chen, J Meng, X Wang, S Matsuoka
arXiv preprint arXiv:2204.02064, 2022
42022
Performance portable back-projection algorithms on CPUs: agnostic data locality and vectorization optimizations
P Chen, M Wahib, X Wang, S Takizawa, T Hirofuchi, H Ogawa, ...
Proceedings of the ACM International Conference on Supercomputing, 316-328, 2021
42021
Revisiting Temporal Blocking Stencil Optimizations
L Zhang, M Wahib, P Chen, J Meng, X Wang, T Endo, S Matsuoka
Proceedings of the 37th International Conference on Supercomputing, 251-263, 2023
32023
Image gradient decomposition for parallel and memory-efficient ptychographic reconstruction
X Wang, A Tsaris, D Mukherjee, M Wahib, P Chen, M Oxley, ...
Proceedings of the International Conference for High Performance Computing …, 2022
32022
Real-time High-resolution X-Ray Computed Tomography
D Wu, P Chen, X Wang, I Lyngaas, T Miyajima, T Endo, S Matsuoka, ...
22024
Ultra-Long Sequence Distributed Transformer
X Wang, I Lyngaas, A Tsaris, P Chen, S Dash, MC Shekar, T Luo, ...
arXiv preprint arXiv:2311.02382, 2023
22023
Pushing the Limits for 2D Convolution Computation On CUDA-enabled GPUs
P Chen, M Wahib, S Takizawa, S Matsuoka
Technical Report, HPC-163, 2018
12018
Asynchronous I/O Optimization for X-Ray Imaging via GPUDirect Storage
D Wu, P Chen, Y Tan, Y Tanimura, T Endo, S Matsuoka, M Wahib
2024 IEEE International Conference on Cluster Computing Workshops (CLUSTER …, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20