Obserwuj
Pouya Kousha
Pouya Kousha
Senior HPC Software Developer, Nvidia
Zweryfikowany adres z osu.edu
Tytuł
Cytowane przez
Cytowane przez
Rok
Nv-group: link-efficient reduction for distributed deep learning on modern dense gpu systems
CH Chu, P Kousha, AA Awan, KS Khorassani, H Subramoni, DK Panda
Proceedings of the 34th ACM International Conference on Supercomputing, 1-12, 2020
512020
Designing high-performance mpi libraries with on-the-fly compression for modern gpu clusters
Q Zhou, C Chu, NS Kumar, P Kousha, SM Ghazimirsaeed, H Subramoni, ...
2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2021
392021
Salar: Scalable and adaptive designs for large message reduction collectives
M Bayatpour, JM Hashmi, S Chakraborty, H Subramoni, P Kousha, ...
2018 IEEE International Conference on Cluster Computing (CLUSTER), 12-23, 2018
392018
Accelerating mpi all-to-all communication with online compression on modern gpu clusters
Q Zhou, P Kousha, Q Anthony, K Shafie Khorassani, A Shafi, ...
International Conference on High Performance Computing, 3-25, 2022
262022
Efficient asynchronous communication progress for MPI without dedicated resources
A Ruhela, H Subramoni, S Chakraborty, M Bayatpour, P Kousha, ...
Proceedings of the 25th European MPI Users' Group Meeting, 1-11, 2018
262018
Designing a profiling and visualization tool for scalable and in-depth analysis of high-performance GPU clusters
P Kousha, B Ramesh, KK Suresh, CH Chu, A Jain, N Sarkauskas, ...
2019 IEEE 26th International Conference on High Performance Computing, Data …, 2019
242019
Efficient design for MPI asynchronous progress without dedicated resources
A Ruhela, H Subramoni, S Chakraborty, M Bayatpour, P Kousha, ...
Parallel Computing 85, 13-26, 2019
152019
Distmile: a distributed multi-level framework for scalable graph embedding
Y He, S Gurukar, P Kousha, H Subramoni, DK Panda, S Parthasarathy
2021 IEEE 28th International Conference on High Performance Computing, Data …, 2021
122021
Accelerated real-time network monitoring and profiling at scale using OSU INAM
P Kousha, KR SD, H Subramoni, DK Panda, H Na, T Dockendorf, ...
Practice and Experience in Advanced Research Computing 2020: Catch the Wave …, 2020
122020
INAM: cross-stack profiling and analysis of communication in MPI-based applications
P Kousha, KR Sankarapandian Dayala Ganesh Ram, M Kedia, ...
Practice and Experience in Advanced Research Computing 2021: Evolution …, 2021
112021
Hy-Fi: Hybrid Five-Dimensional Parallel DNN Training on High-Performance GPU Clusters
A Jain, A Shafi, Q Anthony, P Kousha, H Subramoni, DK Panda
International Conference on High Performance Computing, 109-130, 2022
82022
Democratizing hpc access and use with knowledge graphs
P Kousha, V Sathu, M Lieber, H Subramoni, DK Panda
Proceedings of the SC'23 Workshops of the International Conference on High …, 2023
42023
Sai: Ai-enabled speech assistant interface for science gateways in hpc
P Kousha, A Jain, A Kolli, M Lieber, M Han, N Contini, H Subramoni, ...
International Conference on High Performance Computing, 402-424, 2023
42023
Mpi-xccl: A portable mpi library over collective communication libraries for various accelerators
CC Chen, K Shafie Khorassani, P Kousha, Q Zhou, J Yao, H Subramoni, ...
Proceedings of the SC'23 Workshops of the International Conference on High …, 2023
32023
“Hey CAI”-Conversational AI Enabled User Interface for HPC Tools
P Kousha, A Jain, A Kolli, P Sainath, H Subramoni, A Shafi, DK Panda
International Conference on High Performance Computing, 87-108, 2022
32022
Visualize and Analyze your Network Activities using OSU INAM
H Subramoni, P Kousha, K Ganesh, DK Panda
OpenFabrics Alliance Workshop 2020, 2020
22020
Design and Implementation of an IPC-based Collective MPI Library for Intel GPUs
CC Chen, GKR Kuncham, P Kousha, H Subramoni, DK Panda
Practice and Experience in Advanced Research Computing 2024: Human Powered …, 2024
12024
Designing Conversational AI Enabled Services and Performance Analysis Tools for High-Performance Computing
P Kousha
The Ohio State University, 2024
12024
Benchmarking Modern Databases for Storing and Profiling Very Large Scale HPC Communication Data
P Kousha, Q Zhou, H Subramoni, DK Panda
International Symposium on Benchmarking, Measuring and Optimization, 104-119, 2023
12023
Cross-layer visualization and profiling of network and i/o communication for hpc clusters
P Kousha, Q Anthony, H Subramoni, DK Panda
arXiv preprint arXiv:2109.08329, 2021
12021
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20