关注
João P. L. de Carvalho
João P. L. de Carvalho
在 amd.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
KernelFaRer: replacing native-code idioms with high-performance library calls
JPL De Carvalho, B Kuzma, I Korostelev, JN Amaral, C Barton, J Moreira, ...
ACM Transactions On Architecture And Code Optimization (TACO) 18 (3), 1-22, 2021
252021
Energy-performance tradeoffs in software transactional memory
A Baldassin, JPL De Carvalho, LAG Garcia, R Azevedo
2012 IEEE 24th International Symposium on Computer Architecture and High …, 2012
202012
The case for phase-based transactional memory
JPL de Carvalho, G Araujo, A Baldassin
IEEE Transactions on Parallel and Distributed Systems 30 (2), 459-472, 2018
122018
Revisiting phased transactional memory
JPL De Carvalho, G Araujo, A Baldassin
Proceedings of the International Conference on Supercomputing, 1-10, 2017
92017
To pack or not to pack: A generalized packing analysis and transformation
C Salvador Rohwedder, N Henderson, JPL De Carvalho, Y Chen, ...
Proceedings of the 21st ACM/IEEE International Symposium on Code Generation …, 2023
72023
Pooling acceleration in the DaVinci architecture using Im2col and Col2im instructions
CS Rohwedder, JPL de Carvalho, JN Amaral, G Araújo, G Colmenares, ...
2021 IEEE International Parallel and Distributed Processing Symposium …, 2021
72021
On the Efficiency of Transactional Code Generation: A GCC Case Study
BC Honorio, JPL Carvalho, A Baldassin
Simpósio de Sistemas Computacionais de Alto Desempenho (WSCAD), 2018
62018
Fast matrix multiplication via compiler‐only layered data reorganization and intrinsic lowering
B Kuzma, I Korostelev, JPL De Carvalho, JE Moreira, C Barton, G Araujo, ...
Software: Practice and Experience 53 (9), 1793-1814, 2023
52023
YaConv: Convolution with low cache footprint
I Korostelev, JP L. De Carvalho, J Moreira, JN Amaral
ACM Transactions on Architecture and Code Optimization 20 (1), 1-18, 2023
52023
Compiling for the IBM Matrix Engine for Enterprise Workloads
JPL de Carvalho, JE Moreira, JN Amaral
IEEE MICRO, 1-8, 2022
52022
NV-PhTM: An Efficient Phase-Based Transactional System for Non-Volatile Memory
A Baldassin, RP Murari, JPL Carvalho, G Araujo, D Castro, J Barreto, ...
Euro-Par: 26th International European Conference on Parallel and Distributed …, 2020
52020
An Efficient Parallel Implementation for Training Supervised Optimum-Path Forest Classifiers
A Culquicondor, A Baldassin, C Castelo-Fernandéz, JPL Carvalho, ...
Neurocomputing, 2018
52018
Improving transactional code generation via variable annotation and barrier elision
JPL De Carvalho, BC Honorio, A Baldassin, G Araujo
2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2020
42020
Advancing direct convolution using convolution slicing optimization and isa extensions
V Ferrari, R Sousa, M Pereira, JP L. De Carvalho, JN Amaral, J Moreira, ...
ACM Transactions on Architecture and Code Optimization 20 (4), 1-26, 2023
32023
Reavaliando a Eficiência Energética de Memória Transacional em Processadores Convencionais
JPL de Carvalho, A Baldassin, R Azevedo
Simpósio em Sistemas Computacionais de Alto Desempenho (SSCAD), 69-76, 2013
32013
Improving convolution via cache hierarchy tiling and reduced packing
V Ferrari, R Sousa, M Pereira, JPL de Carvalho, JN Amaral, G Araujo
Proceedings of the International Conference on Parallel Architectures and …, 2022
22022
Vectorizing divergent control fow with active‑lane consolidation on long‑vector architectures
W Praharenka, D Pankratz, JPL de Carvalho, E Amiri, JN Amaral
The Journal of Supercomputing, 2022
22022
Accelerating graph applications using phased transactional memory
CM Morales, R Murari, JPL de Carvalho, BC Honorio, A Baldassin, ...
Euro-Par 2021: Parallel Processing: 27th International Conference on …, 2021
22021
Acceleration opportunities in linear algebra applications via idiom recognition
JP L. de Carvalho, B Kuzma, G Araujo
Companion of the ACM/SPEC International Conference on Performance …, 2020
22020
DOACROSS Parallelization Based on Component Annotation and Loop-carried Probability
L Mattos, D Cesar, J Salamanca, JPL de Carvalho, M Pereira, G Araujo
30th International Symposium on Computer Architecture and High Performance …, 2018
22018
系统目前无法执行此操作,请稍后再试。
文章 1–20