An overview of the BlueGene/L supercomputer NR Adiga, G Almási, GS Almasi, Y Aridor, R Barik, D Beece, R Bellofatto, ... SC'02: Proceedings of the 2002 ACM/IEEE Conference on Supercomputing, 60-60, 2002 | 685 | 2002 |
Bluegene/l failure analysis and prediction models Y Liang, Y Zhang, A Sivasubramaniam, M Jette, R Sahoo International Conference on Dependable Systems and Networks (DSN'06), 425-434, 2006 | 404 | 2006 |
Critical event prediction for proactive management in large-scale computer clusters RK Sahoo, AJ Oliner, I Rish, M Gupta, JE Moreira, S Ma, R Vilalta, ... Proceedings of the ninth ACM SIGKDD international conference on Knowledge …, 2003 | 388 | 2003 |
Failure prediction in ibm bluegene/l event logs Y Liang, Y Zhang, H Xiong, R Sahoo Seventh IEEE International Conference on Data Mining (ICDM 2007), 583-588, 2007 | 357 | 2007 |
Failure data analysis of a large-scale heterogeneous server environment RK Sahoo, MS Squillante, A Sivasubramaniam, Y Zhang International Conference on Dependable Systems and Networks, 2004, 772-781, 2004 | 339 | 2004 |
Filtering failure logs for a bluegene/l prototype Y Liang, Y Zhang, A Sivasubramaniam, RK Sahoo, J Moreira, M Gupta 2005 International Conference on Dependable Systems and Networks (DSN'05 …, 2005 | 201 | 2005 |
Performance implications of failures in large-scale cluster scheduling Y Zhang, MS Squillante, A Sivasubramaniam, RK Sahoo Job Scheduling Strategies for Parallel Processing: 10th International …, 2005 | 179 | 2005 |
Fault-aware job scheduling for bluegene/l systems AJ Oliner, RK Sahoo, JE Moreira, M Gupta, A Sivasubramaniam 18th International Parallel and Distributed Processing Symposium, 2004 …, 2004 | 125 | 2004 |
Cooperative checkpointing: A robust approach to large-scale systems reliability AJ Oliner, L Rudolph, RK Sahoo Proceedings of the 20th annual international conference on Supercomputing, 14-23, 2006 | 109 | 2006 |
An overview of the Blue Gene/L system software organization G Almási, R Bellofatto, J Brunheroto, C Caşcaval, JG Castanos, L Ceze, ... Euro-Par 2003 Parallel Processing: 9th International Euro-Par Conference …, 2003 | 91 | 2003 |
High performance file I/O for the Blue Gene/L supercomputer H Yu, RK Sahoo, C Howson, G Almasi, JG Castanos, M Gupta, JE Moreira, ... The Twelfth International Symposium on High-Performance Computer …, 2006 | 90 | 2006 |
Performance implications of periodic checkpointing on large-scale cluster systems AJ Oliner, RK Sahoo, JE Moreira, M Gupta 19th IEEE International Parallel and Distributed Processing Symposium, 8 pp., 2005 | 85 | 2005 |
Hybrid method for event prediction and system control M Gupta, JE Moreira, AJ Oliner, RK Sahoo US Patent 7,451,210, 2008 | 80 | 2008 |
Blue Gene/L programming and operating environment JE Moreira, G Almási, C Archer, R Bellofatto, P Bergner, JR Brunheroto, ... IBM journal of Research and Development 49 (2.3), 367-376, 2005 | 68 | 2005 |
MemorIES3: a programmable, real-time hardware emulation tool for multiprocessor server design A Nanda, KK Mak, K Sugarvanam, RK Sahoo, V Soundarararjan, ... ACM SIGARCH Computer Architecture News 28 (5), 37-48, 2000 | 61 | 2000 |
Scalable method of continuous monitoring the remotely accessible resources against the node failures for very large clusters MM Bae, RK Sahoo US Patent 7,137,040, 2006 | 56 | 2006 |
Method for using a priority queue to perform job scheduling on a cluster based on node rank and performance RK Sahoo, AJ Oliner US Patent 7,827,435, 2010 | 49 | 2010 |
Towards an integrated approach for analysis and design of wafer slicing by a wire saw RK Sahoo, V Prasad, I Kao, J Talbott, KP Gupta | 45 | 1998 |
An adaptive semantic filter for blue gene/l failure log analysis Y Liang, Y Zhang, H Xiong, R Sahoo 2007 IEEE International Parallel and Distributed Processing Symposium, 1-8, 2007 | 44 | 2007 |
Method for extracting signature from problem records through unstructured and structured text mapping, classification and ranking RB Jennings III, H Huang, Y Ruan, D Saba, RK Sahoo, S Sahu, ... US Patent 8,260,773, 2012 | 41 | 2012 |