Diffusionret: Generative text-video retrieval with diffusion model P Jin, H Li, Z Cheng, K Li, X Ji, C Liu, L Yuan, J Chen Proceedings of the IEEE/CVF international conference on computer vision …, 2023 | 63 | 2023 |
Locality guidance for improving vision transformers on tiny datasets K Li, R Yu, Z Wang, L Yuan, G Song, J Chen European Conference on Computer Vision, 110-127, 2022 | 58 | 2022 |
Out-of-candidate rectification for weakly supervised semantic segmentation Z Cheng, P Qiao, K Li, S Li, P Wei, X Ji, L Yuan, C Liu, J Chen Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 53 | 2023 |
Acseg: Adaptive conceptualization for unsupervised semantic segmentation K Li, Z Wang, Z Cheng, R Yu, Y Zhao, G Song, C Liu, L Yuan, J Chen Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 53* | 2023 |
Parallel vertex diffusion for unified visual grounding Z Cheng, K Li, P Jin, S Li, X Ji, L Yuan, C Liu, J Chen Proceedings of the AAAI Conference on Artificial Intelligence 38 (2), 1326-1334, 2024 | 25 | 2024 |
Lape: Layer-adaptive position embedding for vision transformers with independent layer normalization R Yu, Z Wang, Y Wang, K Li, C Liu, H Duan, X Ji, J Chen Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 11* | 2023 |
Freestyleret: retrieving images from style-diversified queries H Li, Y Jia, P Jin, Z Cheng, K Li, J Sui, C Liu, L Yuan European Conference on Computer Vision, 258-274, 2024 | 10 | 2024 |
Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation K Li, Y Zhao, Z Wang, Z Cheng, P Jin, X Ji, L Yuan, C Liu, J Chen Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 8 | 2023 |
Speechcraft: A fine-grained expressive speech dataset with natural language description Z Jin, J Jia, Q Wang, K Li, S Zhou, S Zhou, X Qin, Z Wu Proceedings of the 32nd ACM International Conference on Multimedia, 1255-1264, 2024 | 7 | 2024 |
GraCo: Granularity-controllable interactive segmentation Y Zhao, K Li, Z Cheng, P Qiao, X Zheng, R Ji, C Liu, L Yuan, J Chen Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 7 | 2024 |
Wico: Win-win cooperation of bottom-up and top-down referring image segmentation Z Cheng, P Jin, H Li, K Li, S Li, X Ji, C Liu, J Chen arXiv preprint arXiv:2306.10750, 2023 | 4 | 2023 |
Local action-guided motion diffusion model for text-to-motion generation P Jin, H Li, Z Cheng, K Li, R Yu, C Liu, X Ji, L Yuan, J Chen European Conference on Computer Vision, 392-409, 2024 | 2 | 2024 |
Instance brownian bridge as texts for open-vocabulary video instance segmentation Z Cheng, K Li, H Li, P Jin, C Liu, X Zheng, R Ji, J Chen arXiv preprint arXiv:2401.09732, 2024 | 2 | 2024 |
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding B Zhang, K Li, Z Cheng, Z Hu, Y Yuan, G Chen, S Leng, Y Jiang, H Zhang, ... arXiv preprint arXiv:2501.13106, 2025 | 1 | 2025 |
Difference in Euclidean Norm Can Cause Semantic Divergence in Batch Normalization. Z Wang, K Li, R Yu, Y Zhao, P Qiao, G Song, F Xu, J Chen, PC Laboratory CoRR, 2022 | 1* | 2022 |
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Z Cheng, H Zhang, K Li, S Leng, Z Hu, F Wu, D Zhao, X Li, L Bing arXiv preprint arXiv:2410.17243, 2024 | | 2024 |
Learning Pseudo 3D Guidance for View-Consistent Texturing with 2D Diffusion K Li, Y Fan, Y Wu, Z Sun, W Yang, X Ji, L Yuan, J Chen European Conference on Computer Vision, 18-34, 2024 | | 2024 |
Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation Supplementary Material P Jin, H Li, Z Cheng, K Li, R Yu, C Liu, X Ji, L Yuan | | |
Supplementary Material for FreestyleRet: Retrieving Images from Style-Diversified Queries H Li, Y Jia, P Jin, Z Cheng, K Li, J Sui, C Liu, L Yuan | | |
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model Supplementary Material P Jin, H Li, Z Cheng, K Li, X Ji, CLL Yuan, J Chen | | |