Xinxin Zhu 朱欣鑫

Citata da

	Tutte	Dal 2019
Citazioni	1080	1076
Indice H	12	12
i10-index	13	13

380

190

285

20182019202020212022202320243 18 35 110 229 310 361

Accesso pubblico

Visualizza tutto

7 articoli

5 articoli

Disponibili

Non disponibili

In base ai mandati di finanziamento

Coautori

Jing Liu 刘静Professor in Institute of Automation of the Chinese Academy Sciences (CASIA)Email verificata su nlpr.ia.ac.cn
Longteng GuoAssociate Professor, Institute of Automation of the Chinese Academy Sciences (CASIA)Email verificata su nlpr.ia.ac.cn
Lixiang Li(李丽香)Professor in School of Cyberspace Security, Beijing University of Posts and TelecommunicationsEmail verificata su bupt.edu.cn
haipeng pengBeijing University of Posts and TelecommunicationsEmail verificata su bupt.edu.cn
Zhiwei FangBusiness Growth BU, JD.COMEmail verificata su jd.com

Segui

Xinxin Zhu 朱欣鑫

Institute of Automation of the Chinese Academy Sciences (CASIA)

Email verificata su nlpr.ia.ac.cn

multimodal Computer Vision


Titolo Ordina per citazioni Ordina per anno Ordina per titolo	Citata da Citata da	Anno
Normalized and geometry-aware self-attention network for image captioning L Guo, J Liu, X Zhu, P Yao, S Lu, H Lu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020	249	2020
Cptr: Full transformer network for image captioning W Liu, S Chen, L Guo, X Zhu, J Liu arXiv preprint arXiv:2101.10804, 2021	204	2021
Captioning transformer with stacked attention modules X Zhu, L Li, J Liu, H Peng, X Niu Applied Sciences 8 (5), 739, 2018	113	2018
Vast: A vision-audio-subtitle-text omni-modality foundation model and dataset S Chen, H Li, Q Wang, Z Zhao, M Sun, X Zhu, J Liu Advances in Neural Information Processing Systems 36, 72842-72866, 2023	84	2023
Valor: Vision-audio-language omni-perception pretraining model and dataset S Chen, X He, L Guo, X Zhu, W Wang, J Tang, J Liu arXiv preprint arXiv:2304.08345, 2023	82	2023
Image captioning with triple-attention and stack parallel LSTM X Zhu, L Li, J Liu, Z Li, H Peng, X Niu Neurocomputing 319, 55-65, 2018	62	2018
Non-autoregressive image captioning with counterfactuals-critical multi-agent learning L Guo, J Liu, X Zhu, X He, J Jiang, H Lu arXiv preprint arXiv:2005.04690, 2020	56	2020
OPT: Omni-perception pre-trainer for cross-modal understanding and generation J Liu, X Zhu, F Liu, L Guo, Z Zhao, M Sun, W Wang, H Lu, S Zhou, J Zhang, ... arXiv preprint arXiv:2107.00249, 2021	47	2021
Chatbridge: Bridging modalities with large language model as a language catalyst Z Zhao, L Guo, T Yue, S Chen, S Shao, X Zhu, Z Yuan, J Liu arXiv preprint arXiv:2305.16103, 2023	44	2023
Global-local propagation network for RGB-D semantic segmentation S Chen, X Zhu, W Liu, X He, J Liu arXiv preprint arXiv:2101.10801, 2021	24	2021
AutoCaption: Image captioning with neural architecture search X Zhu, W Wang, L Guo, J Liu arXiv preprint arXiv:2012.09742, 2020	18	2020
Global-guided selective context network for scene parsing J Jiang, J Liu, J Fu, X Zhu, Z Li, H Lu IEEE Transactions on Neural Networks and Learning Systems 33 (4), 1752-1764, 2020	14	2020
MOSO: Decomposing motion, scene and object for video prediction M Sun, W Wang, X Zhu, J Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	12	2023
Fast sequence generation with multi-agent reinforcement learning L Guo, J Liu, X Zhu, H Lu arXiv preprint arXiv:2101.09698, 2021	9	2021
Sounding video generator: A unified framework for text-guided sounding video generation J Liu, W Wang, S Chen, X Zhu, J Liu IEEE Transactions on Multimedia 26, 141-153, 2023	8	2023
Dual hierarchical temporal convolutional network with QA-aware dynamic normalization for video story question answering F Liu, J Liu, X Zhu, R Hong, H Lu Proceedings of the 28th ACM International Conference on Multimedia, 4253-4261, 2020	8	2020
Mm21 pre-training for video understanding challenge: Video captioning with pretraining techniques S Chen, X Zhu, D Hao, W Liu, J Liu, Z Zhao, L Guo, J Liu Proceedings of the 29th ACM International Conference on Multimedia, 4853-4857, 2021	7	2021
Dynamic warping network for semantic video segmentation J Li, Y Zhao, X He, X Zhu, J Liu Complexity 2021 (1), 6680509, 2021	7	2021
Image captioning with word gate and adaptive self-critical learning X Zhu, L Li, J Liu, L Guo, Z Fang, H Peng, X Niu Applied Sciences 8 (6), 909, 2018	7	2018
Cptr: Full transformer network for image captioning (2021) W Liu, S Chen, L Guo, X Zhu, J Liu arXiv preprint arXiv:2101.10804, 0	6

Il sistema al momento non può eseguire l'operazione. Riprova più tardi.

Articoli 1–20

Citazioni per anno

Citazioni duplicate

Citazioni unite

Aggiungi coautoriCoautori

Segui

Citata da

Coautori