Xizhou Zhu

نقل شده توسط

	همهٔ موارد	از 2019
نقل‌‏قول‌‏ها	16314	16104
شاخص h	34	34
شاخص i10	46	46

7000

3500

1750

5250

2018201920202021202220232024151 301 720 1597 2795 4235 6166

دسترسی عمومی

مشاهدهٔ همه

۱۱ مقاله

۰ مقاله

در دسترس

در دسترس نیست

براساس دستورات هزینه انتشار

دنبال کردن

Xizhou Zhu

Tsinghua University

ایمیل تأیید شده در tsinghua.edu.cn


عنوان به‌ترتیب نقل قول‌ها به‌ترتیب سال به‌ترتیب عنوان	نقل شده توسط نقل شده توسط	سال
Deformable detr: Deformable transformers for end-to-end object detection‏ X Zhu, W Su, L Lu, B Li, X Wang, J Dai‏ arXiv preprint arXiv:2010.04159, 2020‏	5575	2020
Deformable convnets v2: More deformable, better results‏ X Zhu, H Hu, S Lin, J Dai‏ Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019‏	2393	2019
Vl-bert: Pre-training of generic visual-linguistic representations‏ W Su, X Zhu, Y Cao, B Li, L Lu, F Wei, J Dai‏ arXiv preprint arXiv:1908.08530, 2019‏	1883	2019
Deep feature flow for video recognition‏ X Zhu, Y Xiong, J Dai, L Yuan, Y Wei‏ Proceedings of the IEEE conference on computer vision and pattern …, 2017‏	853	2017
Flow-guided feature aggregation for video object detection‏ X Zhu, Y Wang, J Dai, L Yuan, Y Wei‏ Proceedings of the IEEE international conference on computer vision, 408-417, 2017‏	819	2017
Internimage: Exploring large-scale vision foundation models with deformable convolutions‏ W Wang, J Dai, Z Chen, Z Huang, Z Li, X Zhu, X Hu, T Lu, L Lu, H Li, ...‏ Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023‏	676	2023
An empirical study of spatial attention mechanisms in deep networks‏ X Zhu, D Cheng, Z Zhang, S Lin, J Dai‏ Proceedings of the IEEE/CVF international conference on computer vision …, 2019‏	568	2019
Planning-oriented autonomous driving‏ Y Hu, J Yang, L Chen, K Li, C Sima, X Zhu, S Chai, S Du, T Lin, W Wang, ...‏ Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023‏	469	2023
Visionllm: Large language model is also an open-ended decoder for vision-centric tasks‏ W Wang, Z Chen, X Chen, J Wu, X Zhu, G Zeng, P Luo, T Lu, J Zhou, ...‏ Advances in Neural Information Processing Systems 36, 2024‏	382	2024
Towards high performance video object detection‏ X Zhu, J Dai, L Yuan, Y Wei‏ Proceedings of the IEEE conference on computer vision and pattern …, 2018‏	329	2018
Bevformer v2: Adapting modern image backbones to bird's-eye-view recognition via perspective supervision‏ C Yang, Y Chen, H Tian, C Tao, X Zhu, Z Zhang, G Huang, H Li, Y Qiao, ...‏ Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023‏	235	2023
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites‏ Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ...‏ arXiv preprint arXiv:2404.16821, 2024‏	195	2024
Uni-perceiver: Pre-training unified architecture for generic perception for zero-shot and few-shot tasks‏ X Zhu, J Zhu, H Li, X Wu, H Li, X Wang, J Dai‏ Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022‏	126	2022
Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe‏ H Li, C Sima, J Dai, W Wang, L Lu, H Wang, J Zeng, Z Li, J Yang, H Deng, ...‏ IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023‏	117	2023
Spatially adaptive inference with stochastic feature sampling and interpolation‏ Z Xie, Z Zhang, X Zhu, G Huang, S Lin‏ Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020‏	115	2020
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks‏ Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, M Zhong, Q Zhang, X Zhu, ...‏ Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024‏	104	2024
Siamese image modeling for self-supervised vision representation learning‏ C Tao, X Zhu, W Su, G Huang, B Li, J Zhou, Y Qiao, X Wang, J Dai‏ Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023‏	83	2023
An uncertainty-aware approach for exploratory microblog retrieval‏ M Liu, S Liu, X Zhu, Q Liao, F Wei, S Pan‏ IEEE transactions on visualization and computer graphics 22 (1), 250-259, 2015‏	82	2015
Ghost in the minecraft: Generally capable agents for open-world environments via large language models with text-based knowledge and memory‏ X Zhu, Y Chen, H Tian, C Tao, W Su, C Yang, G Huang, B Li, L Lu, ...‏ arXiv preprint arXiv:2305.17144, 2023‏	79	2023
Interngpt: Solving vision-centric tasks by interacting with chatgpt beyond language‏ Z Liu, Y He, W Wang, W Wang, Y Wang, S Chen, Q Zhang, Z Lai, Y Yang, ...‏ arXiv preprint arXiv:2305.05662, 2023‏	77	2023

سیستم در حال حاضر قادر به انجام عملکرد نیست. بعداً دوباره امتحان کنید.

مقاله‌ها 1–20

نقل‌قول‌ها در سال

نقل‌قول تکراری

نقل‌قول‌های ادغام شده

افزودن نویسنده‌های همکارنویسندگان مشترک

دنبال کردن

نقل شده توسط