A practical automatic polyhedral parallelizer and locality optimizer U Bondhugula, A Hartono, J Ramanujam, P Sadayappan Proceedings of the 29th ACM SIGPLAN Conference on Programming Language …, 2008 | 1555* | 2008 |
Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model U Bondhugula, M Baskaran, S Krishnamoorthy, J Ramanujam, A Rountev, ... International Conference on Compiler Construction, 132-146, 2008 | 458 | 2008 |
Dynamic management of scratch-pad memory space M Kandemir, J Ramanujam, MJ Irwin, N Vijaykrishnan, I Kadayif, A Parikh Design Automation Conference, 2001. Proceedings, 690-695, 2001 | 401 | 2001 |
Automatic C-to-CUDA code generation for affine programs MM Baskaran, J Ramanujam, P Sadayappan Compiler Construction: 19th International Conference, CC 2010, Held as Part …, 2010 | 341 | 2010 |
Effective automatic parallelization of stencil computations S Krishnamoorthy, M Baskaran, U Bondhugula, J Ramanujam, A Rountev, ... ACM sigplan notices 42 (6), 235-244, 2007 | 324 | 2007 |
A compiler framework for optimization of affine loop nests for GPGPUs MM Baskaran, U Bondhugula, S Krishnamoorthy, J Ramanujam, ... Proceedings of the 22nd annual international conference on Supercomputing …, 2008 | 301 | 2008 |
Compile-time techniques for data distribution in distributed memory machines J Ramanujam, P Sadayappan IEEE Transactions on parallel and distributed systems 2 (4), 472-482, 1991 | 262 | 1991 |
Synthesis of high-performance parallel programs for a class of ab initio quantum chemistry models G Baumgartner, A Auer, DE Bernholdt, A Bibireata, V Choppella, ... Proceedings of the IEEE 93 (2), 276-292, 2005 | 258 | 2005 |
Tiling multidimensional iteration spaces for multicomputers J Ramanujam, P Sadayappan Journal of Parallel and Distributed Computing 16 (2), 108-120, 1992 | 227 | 1992 |
Task allocation onto a hypercube by recursive mincut bipartitioning F Ercal, J Ramanujam, P Sadayappan Proceedings of the third conference on Hypercube concurrent computers and …, 1988 | 224 | 1988 |
Loop transformations: convexity, pruning and optimization LN Pouchet, U Bondhugula, C Bastoul, A Cohen, J Ramanujam, ... ACM SIGPLAN Notices 46 (1), 549-562, 2011 | 187 | 2011 |
A stencil compiler for short-vector simd architectures T Henretty, R Veras, F Franchetti, LN Pouchet, J Ramanujam, ... Proceedings of the 27th international ACM conference on International …, 2013 | 180 | 2013 |
Cluster partitioning approaches to mapping parallel programs onto a hypercube P Sadayappan, F Ercal, J Ramanujam Parallel computing 13 (1), 1-16, 1990 | 177 | 1990 |
Automatic code generation for many-body electronic structure methods: the tensor contraction engine AA Auer, G Baumgartner, DE Bernholdt, A Bibireata, V Choppella, ... Molecular Physics 104 (2), 211-228, 2006 | 174 | 2006 |
Data layout transformation for stencil computations on short-vector simd architectures T Henretty, K Stock, LN Pouchet, F Franchetti, J Ramanujam, ... Compiler Construction: 20th International Conference, CC 2011, Held as Part …, 2011 | 173 | 2011 |
Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories MM Baskaran, U Bondhugula, S Krishnamoorthy, J Ramanujam, ... Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of …, 2008 | 162 | 2008 |
Improving cache locality by a combination of loop and data transformations M Kandemir, J Ramanujam, A Choudhary IEEE Transactions on Computers 48 (2), 159-167, 1999 | 151 | 1999 |
Improving locality using loop and data transformations in an integrated framework M Kandemir, A Choudhary, J Ramanujam, P Banerjee Proceedings of the 31st annual ACM/IEEE international symposium on …, 1998 | 141 | 1998 |
Exploiting shared scratch pad memory space in embedded multiprocessor systems M Kandemir, J Ramanujam, A Choudhary Proceedings of the 39th annual Design Automation Conference, 219-224, 2002 | 128 | 2002 |
Split tiling for GPUs: automatic parallelization using trapezoidal tiles T Grosser, A Cohen, PHJ Kelly, J Ramanujam, P Sadayappan, ... Proceedings of the 6th Workshop on General Purpose Processor Using Graphics …, 2013 | 125 | 2013 |