Khaled Hamidouche

인용

	전체	2019년 이후
서지정보	1442	787
h-index	21	16
i10-index	40	22

200

100

150

2011201220132014201520162017201820192020202120222023202418 9 18 73 91 99 131 181 157 178 135 109 105 103

공개 액세스

모두 보기

자료 16개

자료 3개

공개

비공개

재정 지원 요구사항 기준

팔로우

Khaled Hamidouche

AMD Research

amd.com의 이메일 확인됨

senior Research Scientist


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Efficient inter-node MPI communication using GPUDirect RDMA for InfiniBand clusters with NVIDIA GPUs S Potluri, K Hamidouche, A Venkatesh, D Bureddy, DK Panda 2013 42nd International Conference on Parallel Processing, 80-89, 2013	187	2013
S-caffe: Co-designing mpi runtimes and caffe for scalable deep learning on modern gpu clusters AA Awan, K Hamidouche, JM Hashmi, DK Panda Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of …, 2017	178	2017
Efficient large message broadcast using NCCL and CUDA-aware MPI for deep learning AA Awan, K Hamidouche, A Venkatesh, DK Panda Proceedings of the 23rd European MPI Users' Group Meeting, 15-22, 2016	59	2016
Designing efficient small message transfer mechanism for inter-node MPI communication on InfiniBand GPU clusters R Shi, S Potluri, K Hamidouche, J Perkins, M Li, D Rossetti, DKDK Panda 2014 21st International Conference on High Performance Computing (HiPC), 1-10, 2014	58	2014
MVAPICH-PRISM: A proxy-based communication framework using InfiniBand and SCIF for Intel MIC clusters S Potluri, D Bureddy, K Hamidouche, A Venkatesh, K Kandalla, ... Proceedings of the International Conference on High Performance Computing …, 2013	52	2013
A case for application-oblivious energy-efficient MPI runtime A Venkatesh, A Vishnu, K Hamidouche, N Tallent, D Panda, D Kerbyson, ... Proceedings of the international conference for high performance computing …, 2015	45	2015
Designing MPI library with dynamic connected transport (DCT) of InfiniBand: early experiences H Subramoni, K Hamidouche, A Venkatesh, S Chakraborty, DK Panda International Supercomputing Conference, 278-295, 2014	40	2014
Hand: A hybrid approach to accelerate non-contiguous data movement using mpi datatypes on gpu clusters R Shi, X Lu, S Potluri, K Hamidouche, J Zhang, DK Panda 2014 43rd International Conference on Parallel Processing, 221-230, 2014	34	2014
Designing optimized mpi broadcast and allreduce for many integrated core (mic) infiniband clusters K Kandalla, A Venkatesh, K Hamidouche, S Potluri, D Bureddy, DK Panda 2013 IEEE 21st Annual Symposium on High-Performance Interconnects, 63-70, 2013	33	2013
Power-check: An energy-efficient checkpointing framework for HPC clusters RR Chandrasekar, A Venkatesh, K Hamidouche, DK Panda 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2015	32	2015
Exploiting GPUDirect RDMA in designing high performance OpenSHMEM for NVIDIA GPU clusters K Hamidouche, A Venkatesh, AA Awan, H Subramoni, CH Chu, ... 2015 IEEE International Conference on Cluster Computing, 78-87, 2015	30	2015
Cuda kernel based collective reduction operations on large-scale gpu clusters CH Chu, K Hamidouche, A Venkatesh, AA Awan, DK Panda 2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2016	29	2016
A scalable and portable approach to accelerate hybrid HPL on heterogeneous CPU-GPU clusters R Shi, S Potluri, K Hamidouche, X Lu, K Tomko, DK Panda 2013 IEEE International Conference on Cluster Computing (CLUSTER), 1-8, 2013	29	2013
INAM²: InfiniBand Network Analysis and Monitoring with MPI H Subramoni, AM Augustine, M Arnold, J Perkins, X Lu, K Hamidouche, ... International Conference on High Performance Computing, 300-320, 2016	28	2016
Scalable Graph500 design with MPI-3 RMA M Li, X Lu, S Potluri, K Hamidouche, J Jose, K Tomko, DK Panda 2014 IEEE International Conference on Cluster Computing (CLUSTER), 230-238, 2014	28	2014
Re-designing CNTK deep learning framework on modern GPU enabled clusters DS Banerjee, K Hamidouche, DK Panda 2016 IEEE international conference on cloud computing technology and science …, 2016	27	2016
GPU triggered networking for intra-kernel communications M LeBeane, K Hamidouche, B Benton, M Breternitz, SK Reinhardt, ... Proceedings of the International Conference for High Performance Computing …, 2017	26	2017
A framework for an automatic hybrid MPI+ OpenMP code generation. K Hamidouche, J Falcou, D Etiemble SpringSim (hpc), 48-55, 2011	26	2011
Parallel smith-waterman comparison on multicore and manycore computing platforms with BSP++ K Hamidouche, FM Mendonca, J Falcou, ACMA de Melo, D Etiemble International Journal of Parallel Programming 41, 111-136, 2013	24	2013
High performance MPI datatype support with user-mode memory registration: Challenges, designs, and benefits M Li, H Subramoni, K Hamidouche, X Lu, DK Panda 2015 IEEE International Conference on Cluster Computing, 226-235, 2015	23	2015

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용