End-to-end object detection with fully convolutional network J Wang, L Song, Z Li, H Sun, J Sun, N Zheng Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 243 | 2021 |
Learning dynamic routing for semantic segmentation Y Li, L Song, Y Chen, Z Li, X Zhang, X Wang, J Sun Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 199 | 2020 |
Gpt4tools: Teaching large language model to use tools via self-instruction R Yang, L Song, Y Li, S Zhao, Y Ge, X Li, Y Shan Advances in Neural Information Processing Systems 36, 2024 | 150 | 2024 |
Yolo-world: Real-time open-vocabulary object detection T Cheng, L Song, Y Ge, W Liu, X Wang, Y Shan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 145 | 2024 |
Tacnet: Transition-aware context network for spatio-temporal action detection L Song, S Zhang, G Yu, H Sun Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 99 | 2019 |
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image Recognition X Ding, Y Zhang, Y Ge, S Zhao, L Song, X Yue, Y Shan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 97 | 2024 |
Learnable tree filter for structure-preserving feature transform L Song, Y Li, Z Li, G Yu, H Sun, J Sun, N Zheng Advances in Neural Information Processing Systems, 1711-1721, 2019 | 53 | 2019 |
Fine-grained dynamic head for object detection L Song, Y Li, Z Jiang, Z Li, H Sun, J Sun, N Zheng Advances in Neural Information Processing Systems 33, 11131-11141, 2020 | 45 | 2020 |
Glnet: Global local network for weakly supervised action localization S Zhang, L Song, C Gao, N Sang IEEE Transactions on Multimedia 22 (10), 2610-2622, 2019 | 38 | 2019 |
Dynamic grained encoder for vision transformers L Song, S Zhang, S Liu, Z Li, X He, H Sun, J Sun, N Zheng Advances in Neural Information Processing Systems 34, 5770-5783, 2021 | 34 | 2021 |
Seed-x: Multimodal models with unified multi-granularity comprehension and generation Y Ge, S Zhao, J Zhu, Y Ge, K Yi, L Song, C Li, X Ding, Y Shan arXiv preprint arXiv:2404.14396, 2024 | 33 | 2024 |
Dbq-ssd: Dynamic ball query for efficient 3d object detection J Yang, L Song, S Liu, W Mao, Z Li, X Li, H Sun, J Sun, N Zheng arXiv preprint arXiv:2207.10909, 2022 | 27 | 2022 |
NIPM-sWMF: Toward efficient FPGA design for high-definition large-disparity stereo matching X Zhang, H Sun, S Chen, L Song, N Zheng IEEE Transactions on Circuits and Systems for Video Technology 29 (5), 1530-1543, 2018 | 25 | 2018 |
Boxsnake: Polygonal instance segmentation with box supervision R Yang, L Song, Y Ge, X Li Proceedings of the IEEE/CVF International Conference on Computer Vision, 766-776, 2023 | 24 | 2023 |
Human centric spatio-temporal action localization J Jiang, Y Cao, L Song, S Zhang, Y Li, Z Xu, Q Wu, C Gan, C Zhang, G Yu ActivityNet Workshop on CVPR, 2018 | 24 | 2018 |
Rethinking learnable tree filter for generic feature transform L Song, Y Li, Z Jiang, Z Li, X Zhang, H Sun, J Sun, N Zheng Advances in Neural Information Processing Systems, 2020 | 20 | 2020 |
Meta-adapter: An online few-shot learner for vision-language model L Song, R Xue, H Wang, H Sun, Y Ge, Y Shan Advances in Neural Information Processing Systems 36, 55361-55374, 2023 | 13 | 2023 |
Instructdet: Diversifying referring object detection with generalized instructions R Dang, J Feng, H Zhang, C Ge, L Song, L Gong, C Liu, Q Chen, F Zhu, ... arXiv preprint arXiv:2310.05136, 2023 | 10 | 2023 |
Workshop on autonomous driving at cvpr 2021: Technical report for streaming perception challenge S Zhang, L Song, S Liu, Z Ge, Z Li, X He, J Sun arXiv preprint arXiv:2108.04230, 2021 | 8 | 2021 |
Low-Rank Approximation for Sparse Attention in Multi-Modal LLMs L Song, Y Chen, S Yang, X Ding, Y Ge, YC Chen, Y Shan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 4 | 2024 |