Co-Designing an OpenMP GPU Runtime and Optimizations for Near-Zero Overhead Execution J Doerfert, A Patel, J Huber, S Tian, JMM Diaz, B Chapman, ... 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2022 | 25 | 2022 |
A Virtual GPU as Developer-Friendly OpenMP Offload Target A Patel, S Tian, J Doerfert, B Chapman 50th International Conference on Parallel Processing Workshop, 1-7, 2021 | 20 | 2021 |
Remote OpenMP offloading A Patel, J Doerfert Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of …, 2022 | 19 | 2022 |
EMISSARY: Enhanced Miss Awareness Replacement Policy for L2 Instruction Caching NP Nagendra, BR Godala, I Chaturvedi, A Patel, S Kanev, T Moseley, ... Proceedings of the 50th Annual International Symposium on Computer …, 2023 | 7 | 2023 |
Representing Data Collections in an SSA Form T McMichen, N Greiner, P Zhong, F Sossai, A Patel, S Campanoni 2024 IEEE/ACM International Symposium on Code Generation and Optimization …, 2024 | 2 | 2024 |
The Parallel Semantics Program Dependence Graph B Homerding, A Patel, EA Deiana, Y Su, Z Tan, Z Xu, BR Godala, ... arXiv preprint arXiv:2402.00986, 2024 | | 2024 |