Theo dõi
Jiayuan Meng
Tiêu đề
Trích dẫn bởi
Trích dẫn bởi
Năm
Rodinia: A benchmark suite for heterogeneous computing
S Che, M Boyer, J Meng, D Tarjan, JW Sheaffer, SH Lee, K Skadron
2009 IEEE international symposium on workload characterization (IISWC), 44-54, 2009
38722009
A performance study of general-purpose applications on graphics processors using CUDA
S Che, M Boyer, J Meng, D Tarjan, JW Sheaffer, K Skadron
Journal of parallel and distributed computing 68 (10), 1370-1380, 2008
9372008
Dynamic warp subdivision for integrated branch and memory divergence tolerance
J Meng, D Tarjan, K Skadron
Proceedings of the 37th annual international symposium on Computer …, 2010
3782010
Performance modeling and automatic ghost zone optimization for iterative stencil loops on GPUs
J Meng, K Skadron
Proceedings of the 23rd international conference on Supercomputing, 256-265, 2009
2012009
GROPHECY: GPU performance projection from CPU code skeletons
J Meng, VA Morozov, K Kumaran, V Vishwanath, TD Uram
Proceedings of 2011 International Conference for High Performance Computing …, 2011
1352011
Best-effort parallel execution framework for recognition and mining applications
J Meng, S Chakradhar, A Raghunathan
2009 IEEE International Symposium on Parallel & Distributed Processing, 1-12, 2009
1342009
Improving GPU performance prediction with data transfer modeling
M Boyer, J Meng, K Kumaran
2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013
792013
Increasing memory miss tolerance for SIMD cores
D Tarjan, J Meng, K Skadron
Proceedings of the Conference on High Performance Computing Networking …, 2009
772009
Avoiding cache thrashing due to private data placement in last-level cache for manycore scaling
J Meng, K Skadron
2009 IEEE international conference on computer design, 282-288, 2009
722009
A performance study for iterative stencil loops on GPUs with ghost zone optimizations
J Meng, K Skadron
International Journal of Parallel Programming 39, 115-142, 2011
672011
Exploiting the forgiving nature of applications for scalable parallel execution
J Mengte, A Raghunathan, S Chakradhar, S Byna
2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010
572010
Workflow performance improvement using model-based scheduling over multiple clusters and clouds
K Maheshwari, ES Jung, J Meng, V Morozov, V Vishwanath, R Kettimuthu
Future Generation Computer Systems 54, 206-218, 2016
412016
Systems and methods for implementing best-effort parallel computing frameworks
S Chakradhar, A Raghunathan, J Meng
US Patent 8,286,172, 2012
382012
Exploiting inter-thread temporal locality for chip multithreading
J Meng, JW Sheaffer, K Skadron
2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010
382010
Best-effort semantic document search on GPUs
S Byna, J Meng, A Raghunathan, S Chakradhar, S Cadambi
Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics …, 2010
372010
Dataflow-driven GPU performance projection for multi-kernel transformations
J Meng, VA Morozov, V Vishwanath, K Kumaran
SC'12: Proceedings of the International Conference on High Performance …, 2012
312012
Skope: A framework for modeling and exploring workload behavior
J Meng, X Wu, V Morozov, V Vishwanath, K Kumaran, V Taylor
Proceedings of the 11th ACM Conference on Computing Frontiers, 1-10, 2014
302014
Dynamic warp subdivision for integrated branch and memory latency divergence tolerance
K Skadron, J Meng, D Tarjan
US Patent App. 13/040,045, 2011
292011
Robust SIMD: Dynamically adapted SIMD width and multi-threading depth
J Meng, JW Sheaffer, K Skadron
2012 IEEE 26th international parallel and distributed processing symposium …, 2012
242012
A multiple SIMD, multiple data (MSMD) architecture: Parallel execution of dynamic and static SIMD fragments
Y Wang, S Chen, J Wan, J Meng, K Zhang, W Liu, X Ning
2013 IEEE 19th International Symposium on High Performance Computer …, 2013
212013
Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.
Bài viết 1–20