‪Zhuofan Zong‬ - ‫محقق Google‬

دریافت نمایه من

نقل شده توسط

	همهٔ موارد	از 2019
نقل‌‏قول‌‏ها	579	577
شاخص h	8	8
شاخص i10	8	8

0

420

210

105

315

202120222023202413 17 107 408

دسترسی عمومی

مشاهدهٔ همه

۵ مقاله

۰ مقاله

در دسترس

در دسترس نیست

براساس دستورات هزینه انتشار

نویسندگان مشترک

Guanglu SongSenior Researcher @Sensetime Base Modelایمیل تأیید شده در sensetime.com
Hongsheng Li (李鸿升)The Chinese University of Hong Kongایمیل تأیید شده در ee.cuhk.edu.hk
Dongzhi JiangMMLab, CUHKایمیل تأیید شده در link.cuhk.edu.hk
Hao ShaoCUHK, MMLabایمیل تأیید شده در link.cuhk.edu.hk
Kunchang LiShenzhen Institutes of Advanced Technology, Chinese Academy of Sciencesایمیل تأیید شده در siat.ac.cn

دنبال کردن

Zhuofan Zong

Zhuofan Zong

MMLab, The Chinese University of Hong Kong

ایمیل تأیید شده در link.cuhk.edu.hk

Large Models Multimodal Object Detection 3D Object Detection


عنوان به‌ترتیب نقل قول‌ها به‌ترتیب سال به‌ترتیب عنوان	نقل شده توسط نقل شده توسط	سال
Detrs with collaborative hybrid assignments training‏ Z Zong, G Song, Y Liu‏ Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023‏	288	2023
Raphael: Text-to-image generation via large mixture of diffusion paths‏ Z Xue, G Song, Q Guo, B Liu, Z Zong, Y Liu, P Luo‏ Advances in Neural Information Processing Systems 36, 2024‏	99	2024
Graph attention based proposal 3d convnets for action detection‏ J Li, X Liu, Z Zong, W Zhao, M Zhang, J Song‏ Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4626-4633, 2020‏	50	2020
Temporal enhanced training of multi-view 3d object detector via historical object prediction‏ Z Zong, D Jiang, G Song, Z Xue, J Su, H Li, Y Liu‏ Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023‏	33	2023
Self-slimmed vision transformer‏ Z Zong, K Li, G Song, Y Wang, Y Qiao, B Leng, Y Liu‏ European Conference on Computer Vision, 432-448, 2022‏	22	2022
RCNet: Reverse feature pyramid and cross-scale shift network for object detection‏ Z Zong, Q Cao, B Leng‏ Proceedings of the 29th ACM International Conference on Multimedia, 5637-5645, 2021‏	22	2021
Mova: Adapting mixture of vision experts to multimodal context‏ Z Zong, B Ma, D Shen, G Song, H Shao, D Jiang, H Li, Y Liu‏ arXiv preprint arXiv:2404.13046, 2024‏	19	2024
Visual cot: Unleashing chain-of-thought reasoning in multi-modal language models‏ H Shao, S Qian, H Xiao, G Song, Z Zong, L Wang, Y Liu, H Li‏ arXiv preprint arXiv:2403.16999, 2024‏	17	2024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching‏ D Jiang, G Song, X Wu, R Zhang, D Shen, Z Zong, Y Liu, H Li‏ arXiv preprint arXiv:2404.03653, 2024‏	7	2024
Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models‏ B Ma, Z Zong, G Song, H Li, Y Liu‏ arXiv preprint arXiv:2406.11831, 2024‏	6	2024
Large-batch optimization for dense visual predictions‏ Z Xue, J Liang, G Song, Z Zong, L Chen, Y Liu, P Luo‏ Advances in Neural Information Processing Systems 1, 2022‏	5	2022
DETRs with collaborative hybrid assignments training (2023)‏ Z Zong, G Song, Y Liu‏ arXiv preprint arXiv:2211.12860, 0‏	5
Visual cot: Advancing multi-modal language models with a comprehensive dataset and benchmark for chain-of-thought reasoning‏ H Shao, S Qian, H Xiao, G Song, Z Zong, L Wang, Y Liu, H Li‏ The Thirty-eight Conference on Neural Information Processing Systems …, 2024‏	3	2024
Large-batch optimization for dense visual predictions: Training faster R-CNN in 4.2 minutes‏ Z Xue, J Liang, G Song, Z Zong, L Chen, Y Liu, P Luo‏ Advances in Neural Information Processing Systems 35, 18694-18706, 2022‏	3	2022

سیستم در حال حاضر قادر به انجام عملکرد نیست. بعداً دوباره امتحان کنید.

مقاله‌ها 1–14