Towards Cross-Platform Performance Portability of DNN Models using SYCL M Goli, K Narasimhan, R Reyes, B Tracy, D Soutar, S Georgiev, ... 2020 IEEE/ACM International Workshop on Performance, Portability and …, 2020 | 17 | 2020 |
A practical tile size selection model for affine loop nests K Narasimhan, A Acharya, A Baid, U Bondhugula Proceedings of the 35th ACM International Conference on Supercomputing, 27-39, 2021 | 11 | 2021 |
Optimizing geometric multigrid method computation using a dsl approach V Vasista, K Narasimhan, S Bhat, U Bondhugula Proceedings of the International Conference for High Performance Computing …, 2017 | 7 | 2017 |
Towards performance portability of ai models using sycl-dnn M Tanvir, K Narasimhan, M Goli, O El Farouki, S Georgiev, I Ault Proceedings of the 10th International Workshop on OpenCL, 1-3, 2022 | 6 | 2022 |
Improving performance of SYCL applications on CPU architectures using LLVM-directed compilation flow P Ghiglio, U Dolinsky, M Goli, K Narasimhan Proceedings of the Thirteenth International Workshop on Programming Models …, 2022 | 5 | 2022 |
User-Driven Online Kernel Fusion for SYCL V Perez, L Sommer, V Lomüller, K Narasimhan, M Goli ACM Transactions on Architecture and Code Optimization, 2023 | 3 | 2023 |
Accelerating Neural Networks Using Open Standard Software on RISC-V K Narasimhan, M Goli International Conference on High Performance Computing, 552-564, 2023 | 2 | 2023 |
Towards performance portability of AI graphs using SYCL K Narasimhan, O El Farouki, M Goli, M Tanvir, S Georgiev, I Ault 2022 IEEE/ACM International Workshop on Performance, Portability and …, 2022 | 2 | 2022 |
Programming Model Extensions for General-Purpose Processing-In-Memory H Hong, L Sommer, B Kim, M Kashkarov, K Narasimhan, I Veselov, M Goli, ... ISC High Performance 2024 Research Paper Proceedings (39th International …, 2024 | | 2024 |
A Performance Analysis of Leading Many-Core Technologies for Cellular Automata Execution A De Rango, D D’Ambrosio, A Senatore, G Mendicino, K Narasimhan, ... European Conference on Parallel Processing, 270-281, 2023 | | 2023 |
Vetter, Jeffrey 45 T Ben-Nun, E Chereshnev, T Deakin, W Elwasif, EM Fomenko, T Gamblin, ... | | |