Follow
Jan Laukemann
Title
Cited by
Cited by
Year
Automated instruction stream throughput prediction for intel and amd microarchitectures
J Laukemann, J Hammer, J Hofmann, G Hager, G Wellein
2018 IEEE/ACM performance modeling, benchmarking and simulation of high …, 2018
61*2018
Alto: Adaptive linearized storage of sparse tensors
AE Helal, J Laukemann, F Checconi, JJ Tithi, T Ranadive, F Petrini, ...
Proceedings of the ACM International Conference on Supercomputing, 404-416, 2021
332021
Execution‐Cache‐Memory modeling and performance tuning of sparse matrix‐vector multiplication and Lattice quantum chromodynamics on A64FX
C Alappat, N Meyer, J Laukemann, T Gruber, G Hager, G Wellein, ...
Concurrency and Computation: Practice and Experience 34 (20), e6512, 2022
312022
Automatic throughput and critical path analysis of x86 and arm assembly kernels
J Laukemann, J Hammer, G Hager, G Wellein
2019 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High …, 2019
282019
Performance modeling of streaming kernels and sparse matrix-vector multiplication on A64FX
C Alappat, J Laukemann, T Gruber, G Hager, G Wellein, N Meyer, ...
2020 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High …, 2020
222020
Efficient, out-of-memory sparse MTTKRP on massively parallel architectures
A Nguyen, AE Helal, F Checconi, J Laukemann, JJ Tithi, Y Soh, ...
Proceedings of the 36th ACM International Conference on Supercomputing, 1-13, 2022
112022
CloverLeaf on Intel Multi-Core CPUs: A Case Study in Write-Allocate Evasion
J Laukemann, T Gruber, G Hager, D Oryspayev, G Wellein
2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2024
32024
Design and Implementation of a Framework for Predicting Instruction Throughput
J Laukemann
22017
Dynamic Tensor Linearization and Time Slicing for Efficient Factorization of Infinite Data Streams
Y Soh, AE Helal, F Checconi, J Laukemann, JJ Tithi, T Ranadive, F Petrini, ...
2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2023
12023
Core-Level Performance Engineering with the Open-Source Architecture Code Analyzer (OSACA) and the Compiler Explorer
J Laukemann, G Hager
Companion of the 2023 ACM/SPEC International Conference on Performance …, 2023
12023
Cross-Architecture Automatic Critical Path Detection For In-Core Performance Analysis
J Laukemann
Friedrich-Alexander-Universität Erlangen-Nürnberg, 2020
12020
Microarchitectural comparison and in-core modeling of state-of-the-art CPUs: Grace, Sapphire Rapids, and Genoa
J Laukemann, G Hager, G Wellein
arXiv preprint arXiv:2409.08108, 2024
2024
Accelerating Sparse Tensor Decomposition Using Adaptive Linearized Representation
J Laukemann, AE Helal, S Anderson, F Checconi, Y Soh, JJ Tithi, ...
arXiv preprint arXiv:2403.06348, 2024
2024
Exploiting Data Compression and Low Precision for Exascale Fusion Turbulence Simulations
J Laukemann, F Jung, CM Pfeiler, D Jimenez, C Clauss, T Dannert, ...
The International Conference for High Performance Computing, Networking …, 2024
2024
MD-Bench: A performance-focused prototyping harness for state-of-the-art short-range molecular dynamics algorithms
RRL Machado, J Eitzinger, J Laukemann, G Hager, H Köstler, G Wellein
Future Generation Computer Systems 149, 25-38, 2023
2023
MD-Bench: Engineering the in-core performance of short-range molecular dynamics kernels from state-of-the-art simulation packages
RRL Machado, J Eitzinger, J Laukemann, G Hager, H Köstler, G Wellein
arXiv preprint arXiv:2302.14660, 2023
2023
MD-Bench: Engineering the in-core performance of short-range molecular dynamics kernels from state-of-the-art simulation packages
R Ravedutti Lucio Machado, J Eitzinger, J Laukemann, G Hager, ...
arXiv e-prints, arXiv: 2302.14660, 2023
2023
Reproducibility report: Team SegFAUlt@ SCC 2016
A Ditter, J Laukemann, B Oehlrich
Parallel Computing 70, 41-45, 2017
2017
PMBS 2019
J Laukemann, J Hammer, G Hager, N Ding, S Williams, J Salmon, ...
The system can't perform the operation now. Try again later.
Articles 1–19