Obserwuj
Zhaozhuo Xu
Tytuł
Cytowane przez
Cytowane przez
Rok
Scissorhands: Exploiting the persistence of importance hypothesis for llm kv cache compression at test time
Z Liu, A Desai, F Liao, W Wang, V Xie, Z Xu, A Kyrillidis, A Shrivastava
Advances in Neural Information Processing Systems 36, 2023
1762023
SAR-to-optical image translation using supervised cycle-consistent adversarial networks
L Wang, X Xu, Y Yu, R Yang, R Gui, Z Xu, F Pu
IEEE Access 7, 129136-129149, 2019
1422019
Deformable convnet with aspect ratio constrained nms for object detection in remote sensing imagery
Z Xu, X Xu, L Wang, R Yang, F Pu
Remote Sensing 9 (12), 1312, 2017
1352017
KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
Z Liu, J Yuan, H Jin, S Zhong, Z Xu, V Braverman, B Chen, X Hu
arXiv preprint arXiv:2402.02750, 2024
1282024
Detection, tracking, and geolocation of moving vehicle from uav using monocular camera
X Zhao, F Pu, Z Wang, H Chen, Z Xu
IEEE Access 7, 101160-101170, 2019
872019
Mongoose: A learnable lsh framework for efficient neural network training
B Chen, Z Liu, B Peng, Z Xu, JL Li, T Dao, Z Song, A Shrivastava, C Re
International Conference on Learning Representations, 2020
822020
Möbius transformation for fast inner product search on graph
Z Zhou, S Tan, Z Xu, P Li
Advances in Neural Information Processing Systems 32, 2019
662019
LLM Multi-Agent Systems: Challenges and Open Problems
S Han, Q Zhang, Y Yao, W Jin, Z Xu, C He
arXiv preprint arXiv:2402.03578, 2024
442024
Fast item ranking under neural network based measures
S Tan, Z Zhou, Z Xu, P Li
Proceedings of the 13th International Conference on Web Search and Data …, 2020
402020
Breaking the Linear Iteration Cost Barrier for Some Well-known Conditional Gradient Methods Using MaxIP Data-structures
Z Xu, Z Song, A Shrivastava
Advances in Neural Information Processing Systems 34, 5576-5589, 2021
352021
Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt
Z Xu, Z Liu, B Chen, Y Tang, J Wang, K Zhou, X Hu, A Shrivastava
arXiv preprint arXiv:2305.11186, 2023
342023
On efficient retrieval of top similarity vectors
S Tan, Z Zhou, Z Xu, P Li
Proceedings of the 2019 Conference on Empirical Methods in Natural Language …, 2019
332019
Fincon: A synthesized llm multi-agent system with conceptual verbal reinforcement for enhanced financial decision making
Y Yu, Z Yao, H Li, Z Deng, Y Jiang, Y Cao, Z Chen, J Suchow, Z Cui, R Liu, ...
Advances in Neural Information Processing Systems 37, 137010-137045, 2024
282024
Speeding up sparsification using inner product search data structures
Z Song, Z Xu, L Zhang
arXiv preprint arXiv:2204.03209, 2022
262022
Norm Adjusted Proximity Graph for Fast Inner Product Retrieval
S Tan, Z Xu, W Zhao, H Fei, Z Zhou, P Li
Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data …, 2021
262021
Raspberry pi based intelligent wireless sensor node for localized torrential rain monitoring
Z Xu, F Pu, X Fang, J Fu
Journal of Sensors 2016 (1), 4178079, 2016
232016
Locality Sensitive Teaching
Z Xu, B Chen, C Li, W Liu, L Song, Y Lin, A Shrivastava
Advances in Neural Information Processing Systems 34, 2021
212021
Sublinear Least-Squares Value Iteration via Locality Sensitive Hashing
A Shrivastava, Z Song, Z Xu
arXiv preprint arXiv:2105.08285, 2021
192021
Winner-take-all column row sampling for memory efficient adaptation of language model
Z Liu, G Wang, SH Zhong, Z Xu, D Zha, RR Tang, ZS Jiang, K Zhou, ...
Advances in Neural Information Processing Systems 36, 3402-3424, 2023
182023
Kv cache is 1 bit per channel: Efficient large language model inference with coupled quantization
T Zhang, J Yi, Z Xu, A Shrivastava
Advances in Neural Information Processing Systems 37, 3304-3331, 2024
162024
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20