Follow
Rui Kong
Rui Kong
Verified email at sjtu.edu.cn - Homepage
Title
Cited by
Cited by
Year
Personal llm agents: Insights and survey about the capability, efficiency and security
Y Li, H Wen, W Wang, X Li, Y Yuan, G Liu, J Liu, W Xu, X Wang, Y Sun, ...
arXiv preprint arXiv:2401.05459, 2024
922024
Convrelu++: Reference-based lossless acceleration of conv-relu operations on mobile cpu
R Kong, Y Li, Y Yuan, L Kong
Proceedings of the 21st Annual International Conference on Mobile Systems …, 2023
82023
WiP: An On-device LLM-based Approach to Query Privacy Protection
Y Yuan, R Kong, Y Li, Y Liu
Proceedings of the Workshop on Edge and Mobile Foundation Models, 7-9, 2024
42024
Serving MoE Models on Resource-constrained Edge Devices via Dynamic Expert Swapping
R Kong, Y Li, Q Feng, W Wang, L Kong, Y Liu
arXiv preprint arXiv:2308.15030, 2023
32023
Patchbackdoor: Backdoor attack against deep neural networks without model modification
Y Yuan, R Kong, S Xie, Y Li, Y Liu
Proceedings of the 31st ACM International Conference on Multimedia, 9134-9142, 2023
22023
LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design
R Kong, Q Li, X Fang, Q Feng, Q He, Y Dong, W Wang, Y Li, L Kong, Y Liu
arXiv preprint arXiv:2405.17741, 2024
12024
SwapMoE: Serving Off-the-shelf MoE-based Large Language Models with Tunable Memory Budget
R Kong, Y Li, Q Feng, W Wang, X Ye, Y Ouyang, L Kong, Y Liu
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–7