Designing topology-aware collective communication algorithms for large scale infiniband clusters: Case studies with scatter and gather K Kandalla, H Subramoni, A Vishnu, DK Panda 2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010 | 110 | 2010 |
Design of a scalable InfiniBand topology service to enable network-topology-aware placement of processes H Subramoni, S Potluri, K Kandalla, B Barth, J Vienne, J Keasler, ... SC'12: Proceedings of the International Conference on High Performance …, 2012 | 86 | 2012 |
SR-IOV support for virtualization on infiniband clusters: Early experience J Jose, M Li, X Lu, KC Kandalla, MD Arnold, DK Panda 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid …, 2013 | 82 | 2013 |
Scalable memcached design for infiniband clusters using hybrid transports J Jose, H Subramoni, K Kandalla, M Wasi-ur-Rahman, H Wang, ... 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2012 | 81 | 2012 |
High-performance and scalable non-blocking all-to-all with collective offload on InfiniBand clusters: a study with parallel 3D FFT K Kandalla, H Subramoni, K Tomko, D Pekurovsky, S Sur, DK Panda Computer Science-Research and Development 26 (3), 237-246, 2011 | 77 | 2011 |
Designing multi-leader-based allgather algorithms for multi-core clusters K Kandalla, H Subramoni, G Santhanaraman, M Koop, DK Panda 2009 IEEE International Symposium on Parallel & Distributed Processing, 1-8, 2009 | 67 | 2009 |
Design and evaluation of network topology-/speed-aware broadcast algorithms for infiniband clusters H Subramoni, K Kandalla, J Vienne, S Sur, B Barth, K Tomko, R Mclay, ... 2011 IEEE International Conference on Cluster Computing, 317-325, 2011 | 57 | 2011 |
Designing power-aware collective communication algorithms for InfiniBand clusters K Kandalla, EP Mancini, S Sur, DK Panda 2010 39th International Conference on Parallel Processing, 218-227, 2010 | 57 | 2010 |
Gpcnet: Designing a benchmark suite for inducing and measuring contention in hpc networks S Chunduri, T Groves, P Mendygral, B Austin, J Balma, K Kandalla, ... Proceedings of the International Conference for High Performance Computing …, 2019 | 54 | 2019 |
MVAPICH-PRISM: A proxy-based communication framework using InfiniBand and SCIF for Intel MIC clusters S Potluri, D Bureddy, K Hamidouche, A Venkatesh, K Kandalla, ... Proceedings of the International Conference on High Performance Computing …, 2013 | 52 | 2013 |
Efficient intra-node communication on intel-mic clusters S Potluri, A Venkatesh, D Bureddy, K Kandalla, DK Panda 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid …, 2013 | 49 | 2013 |
Designing non-blocking allreduce with collective offload on InfiniBand clusters: A case study with conjugate gradient solvers K Kandalla, U Yang, J Keasler, T Kolev, A Moody, H Subramoni, K Tomko, ... 2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012 | 41 | 2012 |
WOMBAT: A scalable and high-performance astrophysical magnetohydrodynamics code PJ Mendygral, N Radcliffe, K Kandalla, D Porter, BJ O’Neill, C Nolting, ... The Astrophysical Journal Supplement Series 228 (2), 23, 2017 | 39 | 2017 |
Supporting hybrid MPI and OpenSHMEM over InfiniBand: Design and performance evaluation J Jose, K Kandalla, M Luo, DK Panda 2012 41st International Conference on Parallel Processing, 219-228, 2012 | 39 | 2012 |
Design and evaluation of generalized collective communication primitives with overlap using connectx-2 offload engine H Subramoni, K Kandalla, S Sur, DK Panda 2010 18th IEEE Symposium on High Performance Interconnects, 40-49, 2010 | 36 | 2010 |
Designing optimized mpi broadcast and allreduce for many integrated core (mic) infiniband clusters K Kandalla, A Venkatesh, K Hamidouche, S Potluri, D Bureddy, DK Panda 2013 IEEE 21st Annual Symposium on High-Performance Interconnects, 63-70, 2013 | 33 | 2013 |
MPI alltoall personalized exchange on GPGPU clusters: Design alternatives and benefit AK Singh, S Potluri, H Wang, K Kandalla, S Sur, DK Panda 2011 IEEE International Conference on Cluster Computing, 420-427, 2011 | 31 | 2011 |
Evaluating the networking characteristics of the Cray XC‐40 Intel Knights Landing‐based Cori supercomputer at NERSC D Doerfler, B Austin, B Cook, J Deslippe, K Kandalla, P Mendygral Concurrency and Computation: Practice and Experience 30 (1), e4297, 2018 | 29 | 2018 |
Evaluation of energy characteristics of mpi communication primitives with rapl A Venkatesh, K Kandalla, DK Panda 2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013 | 27 | 2013 |
Designing non-blocking broadcast with collective offload on infiniband clusters: A case study with hpl K Kandalla, H Subramoni, J Vienne, SP Raikar, K Tomko, S Sur, ... 2011 IEEE 19th Annual Symposium on High Performance Interconnects, 27-34, 2011 | 26 | 2011 |