Block convolution: toward memory-efficient inference of large-scale CNNs on FPGA G Li, Z Liu, F Li, J Cheng
IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2021
72 2021 Hardware acceleration of fully quantized bert for efficient natural language processing Z Liu, G Li, J Cheng
2021 Design, Automation & Test in Europe Conference & Exhibition (DATE), 513-516, 2021
61 2021 EBERT: Efficient BERT inference with dynamic structured pruning Z Liu, F Li, G Li, J Cheng
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021
47 2021 : Aggregation-Aware Quantization for Graph Neural Networks Z Zhu, F Li, Z Mo, Q Hu, G Li, Z Liu, X Liang, J Cheng
arXiv preprint arXiv:2302.00193, 2023
14 2023 A system-level solution for low-power object detection F Li, Z Mo, P Wang, Z Liu, J Zhang, G Li, Q Hu, X He, C Leng, Y Zhang, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
10 2019 MEGA: A Memory-Efficient GNN Accelerator Exploiting Degree-Aware Mixed-Precision Quantization Z Zhu, F Li, G Li, Z Liu, Z Mo, Q Hu, X Liang, J Cheng
2024 IEEE International Symposium on High-Performance Computer Architecture …, 2024
8 2024 Hardware acceleration of CNN with one-hot quantization of weights and activations G Li, P Wang, Z Liu, C Leng, J Cheng
2020 Design, Automation & Test in Europe Conference & Exhibition (DATE), 971-974, 2020
7 2020 Efficient accelerator/network co-search with circular greedy reinforcement learning Z Liu, G Li, J Cheng
IEEE Transactions on Circuits and Systems II: Express Briefs 70 (7), 2615-2619, 2023
6 2023 TBERT: Dynamic BERT Inference with Top-k Based Predictors Z Liu, K Zhao, J Cheng
2023 Design, Automation & Test in Europe Conference & Exhibition (DATE), 1-6, 2023
2 2023