Ang Li

Cited by

	All	Since 2019
Citations	4083	3833
h-index	36	35
i10-index	78	77

1300

650

325

975

20162017201820192020202120222023202438 56 140 204 263 480 640 955 1271

Public access

View all

94 articles

10 articles

available

not available

Based on funding mandates

Co-authors

Tony (Tong) GengAssistant Professor, University of RochesterVerified email at rochester.edu
Samuel SteinPacific Northwest National LaboratoryVerified email at pnnl.gov
Chunshu WuPostdoctoral Research Fellow at University of Rochester, ECE DepartmentVerified email at ur.rochester.edu
Martin HerbordtProfessor, Electrical and Computer Engineering, Boston UniversityVerified email at bu.edu
Shuaiwen Leon SongVP of Research, Together.ai; Ex-Microsoft; Tenured ProfessorVerified email at together.ai
Cheng TanGoogle, Arizona State UniversityVerified email at google.com
Antonino TumeoPacific Northwest National LaboratoryVerified email at pnnl.gov
Yufei DingUniversity of California, San DiegoVerified email at ucsd.edu
Akash KumarFull Professor, Chair of Embedded Systems, Ruhr University BochumVerified email at rub.de
Henk CorporaalProfessor Embedded System Architectures, Eindhoven University of TechnologyVerified email at tue.nl
Qiang GuanKent State UniversityVerified email at cs.kent.edu
Ying MaoAssociate Professor, Fordham UniversityVerified email at cis.fordham.edu
Caiwen DingAssociate Professor, University of Minnesota Twin CitiesVerified email at umn.edu
Kevin J. BarkerHigh Performance Computing Group Lead, Pacific Northwest National LaboratoryVerified email at pnnl.gov
Jiajia LiNorth Carolina State UniversityVerified email at ncsu.edu
Yanfei LiPostdoc, Pacific Northwest National LaboratoryVerified email at pnnl.gov
Hongwu PengPh.D. Student, University of ConnecticutVerified email at uconn.edu
Yingyan (Celine) LinAssociate Professor, Georgia Institute of TechnologyVerified email at gatech.edu
Anqi GuoBoston UniversityVerified email at bu.edu
Sriram KrishnamoorthyGoogleVerified email at google.com

Ang Li

Pacific Northwest National Laboratory and University of Washington

Verified email at pnnl.gov - Homepage

GPU High Performance Computing Quantum Computing Computer Architecture


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Superneurons: Dynamic GPU memory management for training deep neural networks L Wang, J Ye, Y Zhao, W Wu, A Li, SL Song, Z Xu, T Kraska Proceedings of the 23rd ACM SIGPLAN symposium on principles and practice of …, 2018	311	2018
AWB-GCN: A graph convolutional network accelerator with runtime workload rebalancing T Geng, A Li, R Shi, C Wu, T Wang, Y Li, P Haghi, A Tumeo, S Che, ... 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020	302	2020
Evaluating modern gpu interconnect: Pcie, nvlink, nv-sli, nvswitch and gpudirect A Li, SL Song, J Chen, J Li, X Liu, NR Tallent, KJ Barker IEEE Transactions on Parallel and Distributed Systems 31 (1), 94-110, 2019	279	2019
Qasmbench: A low-level quantum benchmark suite for nisq evaluation and simulation A Li, S Stein, S Krishnamoorthy, J Ang ACM Transactions on Quantum Computing 4 (2), 1-26, 2023	187*	2023
I-GCN: A graph convolutional network accelerator with runtime locality enhancement through islandization T Geng, C Wu, Y Zhang, C Tan, C Xie, H You, M Herbordt, Y Lin, A Li MICRO-54: 54th annual IEEE/ACM international symposium on microarchitecture …, 2021	116	2021
A synchronization-free algorithm for parallel sparse triangular solves W Liu, A Li, J Hogg, IS Duff, B Vinter Euro-Par 2016: Parallel Processing: 22nd International Conference on …, 2016	113	2016
Accelerating transformer-based deep learning models on fpgas using column balanced block pruning H Peng, S Huang, T Geng, A Li, W Jiang, H Liu, S Wang, C Ding 2021 22nd International Symposium on Quality Electronic Design (ISQED), 142-148, 2021	100	2021
Adaptive and transparent cache bypassing for GPUs A Li, GJ van den Braak, A Kumar, H Corporaal Proceedings of the International Conference for High Performance Computing …, 2015	95	2015
Locality-aware CTA clustering for modern GPUs A Li, SL Song, W Liu, X Liu, A Kumar, H Corporaal ACM SIGARCH Computer Architecture News 45 (1), 297-311, 2017	94	2017
Qugan: A quantum state fidelity based generative adversarial network SA Stein, B Baheri, D Chen, Y Mao, Q Guan, A Li, B Fang, S Xu 2021 IEEE International Conference on Quantum Computing and Engineering (QCE …, 2021	89*	2021
Bns-gcn: Efficient full-graph training of graph convolutional networks with partition-parallelism and random boundary node sampling C Wan, Y Li, A Li, NS Kim, Y Lin Proceedings of Machine Learning and Systems 4, 673-693, 2022	77	2022
Tartan: evaluating modern GPU interconnect via a multi-GPU benchmark suite A Li, SL Song, J Chen, X Liu, N Tallent, K Barker 2018 IEEE International Symposium on Workload Characterization (IISWC), 191-202, 2018	68	2018
OpenCGRA: An open-source unified framework for modeling, testing, and evaluating CGRAs C Tan, C Xie, A Li, KJ Barker, A Tumeo 2020 IEEE 38th International Conference on Computer Design (ICCD), 381-388, 2020	64	2020
FPDeep: Scalable acceleration of CNN training on deeply-pipelined FPGA clusters T Wang, T Geng, A Li, X Jin, M Herbordt IEEE Transactions on Computers 69 (8), 1143-1158, 2020	63*	2020
Fine-grained synchronizations and dataflow programming on GPUs A Li, GJ van den Braak, H Corporaal, A Kumar Proceedings of the 29th ACM on International Conference on Supercomputing …, 2015	61	2015
Fast synchronization‐free algorithms for parallel sparse triangular solves with multiple right‐hand sides W Liu, A Li, JD Hogg, IS Duff, B Vinter Concurrency and Computation: Practice and Experience 29 (21), e4244, 2017	59	2017
Exploring and analyzing the real impact of modern on-package memory on HPC scientific kernels A Li, W Liu, MRB Kristensen, B Vinter, H Wang, K Hou, A Marquez, ... Proceedings of the International Conference for High Performance Computing …, 2017	58	2017
Quclassi: A hybrid deep neural network architecture based on quantum state fidelity SA Stein, B Baheri, D Chen, Y Mao, Q Guan, A Li, S Xu, C Ding Proceedings of Machine Learning and Systems 4, 251-264, 2022	56	2022
Cudaadvisor: Llvm-based runtime profiling for modern gpus D Shen, SL Song, A Li, X Liu Proceedings of the 2018 International Symposium on Code Generation and …, 2018	55	2018
Gcod: Graph convolutional network acceleration via dedicated algorithm and accelerator co-design H You, T Geng, Y Zhang, A Li, Y Lin 2022 IEEE International Symposium on High-Performance Computer Architecture …, 2022	54	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors