In-datacenter performance analysis of a tensor processing unit NP Jouppi, C Young, N Patil, D Patterson, G Agrawal, R Bajwa, S Bates, ... Proceedings of the 44th annual international symposium on computer …, 2017 | 5744 | 2017 |
Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations J Mars, L Tang, R Hundt, K Skadron, ML Soffa Proceedings of the 44th annual IEEE/ACM International Symposium on …, 2011 | 785 | 2011 |
The impact of memory subsystem resource sharing on datacenter applications L Tang, J Mars, N Vachharajani, R Hundt, ML Soffa ACM SIGARCH Computer Architecture News 39 (3), 283-294, 2011 | 282 | 2011 |
Google-wide profiling: A continuous profiling infrastructure for data centers G Ren, E Tune, T Moseley, Y Shi, S Rus, R Hundt IEEE micro 30 (4), 65-79, 2010 | 254 | 2010 |
Contention aware execution: online contention detection and response J Mars, N Vachharajani, R Hundt, ML Soffa Proceedings of the 8th annual IEEE/ACM international symposium on Code …, 2010 | 143 | 2010 |
Dynamic instrumentation of an executable program by means of causing a breakpoint at the entry point of a function and providing instrumentation code R Hundt, V Ramasamy, E Gouriou, DJ Babcock, TC Lofgren, JG Rivera, ... US Patent 6,918,110, 2005 | 139 | 2005 |
Loop recognition in c++/java/go/scala R Hundt Proceedings of Scala Days 2011 (86), 2, 2011 | 134 | 2011 |
Heterogeneity in “homogeneous” warehouse-scale computers: A performance opportunity J Mars, L Tang, R Hundt IEEE Computer Architecture Letters 10 (2), 29-32, 2011 | 120 | 2011 |
Optimizing Google's warehouse scale computers: The NUMA experience L Tang, J Mars, X Zhang, R Hagmann, R Hundt, E Tune 2013 IEEE 19th International Symposium on High Performance Computer …, 2013 | 97 | 2013 |
gpucc: an open-source GPGPU compiler J Wu, A Belevich, E Bendersky, M Heffernan, C Leary, J Pienaar, B Roune, ... Proceedings of the 2016 International Symposium on Code Generation and …, 2016 | 89 | 2016 |
Taming hardware event samples for FDO compilation D Chen, N Vachharajani, R Hundt, S Liao, V Ramasamy, P Yuan, W Chen, ... Proceedings of the 8th annual IEEE/ACM international symposium on Code …, 2010 | 89 | 2010 |
RACEZ: A lightweight and non-invasive race detection tool for production applications T Sheng, N Vachharajani, S Eranian, R Hundt, W Chen, W Zheng Proceedings of the 33rd International Conference on Software Engineering …, 2011 | 80 | 2011 |
System and method for processing breakpoint events in a child process generated by a parent process E Gouriou, R Hundt, S Saraswati US Patent 7,185,320, 2007 | 79 | 2007 |
Scenario based optimization: A framework for statically enabling online optimizations J Mars, R Hundt 2009 International Symposium on Code Generation and Optimization, 169-179, 2009 | 63 | 2009 |
Practical structure layout optimization and advice R Hundt, S Mannarswamy, D Chakrabarti International Symposium on Code Generation and Optimization (CGO'06), 12 pp.-244, 2006 | 59 | 2006 |
Mao—An extensible micro-architectural optimizer R Hundt, E Raman, M Thuresson, N Vachharajani International Symposium on Code Generation and Optimization (CGO 2011), 1-10, 2011 | 54 | 2011 |
Increasing utilization in modern warehouse-scale computers using bubble-up J Mars, L Tang, K Skadron, ML Soffa, R Hundt IEEE Micro 32 (3), 88-99, 2012 | 53 | 2012 |
Taming hardware event samples for precise and versatile feedback directed optimizations D Chen, N Vachharajani, R Hundt, X Li, S Eranian, W Chen, W Zheng IEEE Transactions on Computers 62 (2), 376-389, 2011 | 50 | 2011 |
Augmenting debuggers R Hundt US Patent App. 09/846,222, 2004 | 49 | 2004 |
Unwinding instrumented program code R Hundt, V Ramasamy US Patent 7,131,115, 2006 | 48* | 2006 |