دنبال کردن
Zhuofan Zong
Zhuofan Zong
ایمیل تأیید شده در link.cuhk.edu.hk
عنوان
نقل شده توسط
نقل شده توسط
سال
Detrs with collaborative hybrid assignments training
Z Zong, G Song, Y Liu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
2882023
Raphael: Text-to-image generation via large mixture of diffusion paths
Z Xue, G Song, Q Guo, B Liu, Z Zong, Y Liu, P Luo
Advances in Neural Information Processing Systems 36, 2024
992024
Graph attention based proposal 3d convnets for action detection
J Li, X Liu, Z Zong, W Zhao, M Zhang, J Song
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4626-4633, 2020
502020
Temporal enhanced training of multi-view 3d object detector via historical object prediction
Z Zong, D Jiang, G Song, Z Xue, J Su, H Li, Y Liu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
332023
Self-slimmed vision transformer
Z Zong, K Li, G Song, Y Wang, Y Qiao, B Leng, Y Liu
European Conference on Computer Vision, 432-448, 2022
222022
RCNet: Reverse feature pyramid and cross-scale shift network for object detection
Z Zong, Q Cao, B Leng
Proceedings of the 29th ACM International Conference on Multimedia, 5637-5645, 2021
222021
Mova: Adapting mixture of vision experts to multimodal context
Z Zong, B Ma, D Shen, G Song, H Shao, D Jiang, H Li, Y Liu
arXiv preprint arXiv:2404.13046, 2024
192024
Visual cot: Unleashing chain-of-thought reasoning in multi-modal language models
H Shao, S Qian, H Xiao, G Song, Z Zong, L Wang, Y Liu, H Li
arXiv preprint arXiv:2403.16999, 2024
172024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
D Jiang, G Song, X Wu, R Zhang, D Shen, Z Zong, Y Liu, H Li
arXiv preprint arXiv:2404.03653, 2024
72024
Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models
B Ma, Z Zong, G Song, H Li, Y Liu
arXiv preprint arXiv:2406.11831, 2024
62024
Large-batch optimization for dense visual predictions
Z Xue, J Liang, G Song, Z Zong, L Chen, Y Liu, P Luo
Advances in Neural Information Processing Systems 1, 2022
52022
DETRs with collaborative hybrid assignments training (2023)
Z Zong, G Song, Y Liu
arXiv preprint arXiv:2211.12860, 0
5
Visual cot: Advancing multi-modal language models with a comprehensive dataset and benchmark for chain-of-thought reasoning
H Shao, S Qian, H Xiao, G Song, Z Zong, L Wang, Y Liu, H Li
The Thirty-eight Conference on Neural Information Processing Systems …, 2024
32024
Large-batch optimization for dense visual predictions: Training faster R-CNN in 4.2 minutes
Z Xue, J Liang, G Song, Z Zong, L Chen, Y Liu, P Luo
Advances in Neural Information Processing Systems 35, 18694-18706, 2022
32022
سیستم در حال حاضر قادر به انجام عملکرد نیست. بعداً دوباره امتحان کنید.
مقاله‌ها 1–14