オープン アクセスを義務付けられた論文 - Azzam Haidar詳細
一般公開: 63 件
Harnessing GPU tensor cores for fast FP16 arithmetic to speed up mixed-precision iterative refinement solvers
A Haidar, S Tomov, J Dongarra, NJ Higham
SC18: International Conference for High Performance Computing, Networking …, 2018
委任: US National Science Foundation, US Department of Energy, UK Engineering and …
Performance, design, and autotuning of batched GEMM for GPUs
A Abdelfattah, A Haidar, S Tomov, J Dongarra
High Performance Computing: 31st International Conference, ISC High …, 2016
委任: US National Science Foundation, US Department of Energy
The singular value decomposition: Anatomy of optimizing an algorithm for extreme scale
J Dongarra, M Gates, A Haidar, J Kurzak, P Luszczek, S Tomov, ...
SIAM review 60 (4), 808-865, 2018
委任: US National Science Foundation, US Department of Energy
High-performance tensor contractions for GPUs
A Abdelfattah, M Baboulin, V Dobrev, J Dongarra, C Earl, J Falcou, ...
Procedia Computer Science 80, 108-118, 2016
委任: US National Science Foundation, US Department of Energy
Investigating half precision arithmetic to accelerate dense linear system solvers
A Haidar, P Wu, S Tomov, J Dongarra
Proceedings of the 8th workshop on latest advances in scalable algorithms …, 2017
委任: US National Science Foundation, US Department of Energy
RETRACTED: Batched matrix computations on hardware accelerators based on GPUs
A Haidar, T Dong, P Luszczek, S Tomov, J Dongarra
The International Journal of High Performance Computing Applications 29 (2 …, 2015
委任: US Department of Energy
Mixed-precision iterative refinement using tensor cores on GPUs to accelerate solution of linear systems
A Haidar, H Bayraktar, S Tomov, J Dongarra, NJ Higham
Proceedings of the Royal Society A 476 (2243), 20200110, 2020
委任: US Department of Energy, UK Engineering and Physical Sciences Research Council
PLASMA: Parallel linear algebra software for multicore using OpenMP
J Dongarra, M Gates, A Haidar, J Kurzak, P Luszczek, P Wu, I Yamazaki, ...
ACM Transactions on Mathematical Software (TOMS) 45 (2), 1-35, 2019
委任: US National Science Foundation, UK Engineering and Physical Sciences …
High-performance matrix-matrix multiplications of very small matrices
I Masliah, A Abdelfattah, A Haidar, S Tomov, M Baboulin, J Falcou, ...
Euro-Par 2016: Parallel Processing: 22nd International Conference on …, 2016
委任: US National Science Foundation, US Department of Energy
A framework for batched and GPU-resident factorization algorithms applied to block householder transformations
A Haidar, TT Dong, S Tomov, P Luszczek, J Dongarra
High Performance Computing: 30th International Conference, ISC High …, 2015
委任: US Department of Energy
Investigating power capping toward energy‐efficient scientific applications
A Haidar, H Jagode, P Vaccaro, A YarKhan, S Tomov, J Dongarra
Concurrency and Computation: Practice and Experience 31 (6), e4485, 2019
委任: US National Science Foundation, US Department of Energy
The design of fast and energy-efficient linear solvers: On the potential of half-precision arithmetic and iterative refinement techniques
A Haidar, A Abdelfattah, M Zounon, P Wu, S Pranesh, S Tomov, ...
International conference on computational science, 586-600, 2018
委任: US National Science Foundation, US Department of Energy
HPC Programming on Intel Many‐Integrated‐Core Hardware with MAGMA Port to Xeon Phi
J Dongarra, M Gates, A Haidar, Y Jia, K Kabir, P Luszczek, S Tomov
Scientific Programming 2015 (1), 502593, 2015
委任: US Department of Energy
With extreme computing, the rules have changed
J Dongarra, S Tomov, P Luszczek, J Kurzak, M Gates, I Yamazaki, H Anzt, ...
Computing in Science & Engineering 19 (3), 52-62, 2017
委任: US National Science Foundation, US Department of Energy
Impacts of multi-gpu mpi collective communications on large fft computation
A Ayala, S Tomov, X Luo, H Shaeik, A Haidar, G Bosilca, J Dongarra
2019 IEEE/ACM Workshop on Exascale MPI (ExaMPI), 12-18, 2019
委任: US Department of Energy
A proposed API for batched basic linear algebra subprograms
J Dongarra, I Duff, M Gates, A Haidar, S Hammarling, NJ Higham, J Hogg, ...
Manchester Institute for Mathematical Sciences, University of Manchester, 2016
委任: US National Science Foundation, US Department of Energy, European Commission
Towards achieving performance portability using directives for accelerators
MG Lopez, VV Larrea, W Joubert, O Hernandez, A Haidar, S Tomov, ...
2016 Third Workshop on Accelerator Programming Using Directives (WACCPD), 13-24, 2016
委任: US National Science Foundation, US Department of Energy
A set of batched basic linear algebra subprograms and LAPACK routines
A Abdelfattah, T Costa, J Dongarra, M Gates, A Haidar, S Hammarling, ...
ACM Transactions on Mathematical Software (TOMS) 47 (3), 1-23, 2021
委任: US National Science Foundation, US Department of Energy, European Commission
A guide for achieving high performance with very small matrices on GPU: a case study of batched LU and Cholesky factorizations
A Haidar, A Abdelfattah, M Zounon, S Tomov, J Dongarra
IEEE Transactions on Parallel and Distributed Systems 29 (5), 973-984, 2017
委任: US National Science Foundation, US Department of Energy
Novel HPC techniques to batch execution of many variable size BLAS computations on GPUs
A Abdelfattah, A Haidar, S Tomov, J Dongarra
Proceedings of the International Conference on Supercomputing, 1-10, 2017
委任: US National Science Foundation, US Department of Energy
公開と助成金に関する情報は、コンピュータ プログラムによって自動的に決定されます