Yinan He

Citata da

	Tutte	Dal 2019
Citazioni	2399	2397
Indice H	16	16
i10-index	17	17

1900

950

475

1425

20222023202434 447 1802

Accesso pubblico

Visualizza tutto

4 articoli

0 articoli

Disponibili

Non disponibili

In base ai mandati di finanziamento

Coautori

Yu QiaoProfessor of Shanghai AI Laboratory; Shenzhen Institutes of Advanced Technology, CASEmail verificata su siat.ac.cn
Limin WangNanjing UniversityEmail verificata su nju.edu.cn
Yi WangShanghai AI LaboratoryEmail verificata su cse.cuhk.edu.hk
Yali WangProfessor, Shenzhen Institutes of Advanced Technology，Chinese Academy of SciencesEmail verificata su siat.ac.cn
Kunchang LiShenzhen Institutes of Advanced Technology, Chinese Academy of SciencesEmail verificata su siat.ac.cn
Yizhuo LiThe University of Hong KongEmail verificata su cs.hku.hk
Jiashuo YuShanghai AI LaboratoryEmail verificata su fudan.edu.cn
Ziwei LiuAssistant Professor, Nanyang Technological UniversityEmail verificata su ntu.edu.sg
Xinhao LiNanjing UniversityEmail verificata su smail.nju.edu.cn
Lu ShengSchool of Software, Beihang UniversityEmail verificata su buaa.edu.cn
Ziqi HuangPh.D. Student, MMLab@NTU, Nanyang Technological UniversityEmail verificata su e.ntu.edu.sg
Siyu ChenCarnegie Mellon UniversityEmail verificata su andrew.cmu.edu
Jing ShaoResearch Scientist, Shanghai AI Laboratory/Shanghai Jiao Tong University

Segui

Yinan He

Shanghai Al Laboratory

Email verificata su pjlab.org.cn


Titolo Ordina per citazioni Ordina per anno Ordina per titolo	Citata da Citata da	Anno
Videochat: Chat-centric video understanding KC Li, Y He, Y Wang, Y Li, W Wang, P Luo, Y Wang, L Wang, Y Qiao arXiv preprint arXiv:2305.06355, 2023	462	2023
Videomae v2: Scaling video masked autoencoders with dual masking L Wang, B Huang, Z Zhao, Z Tong, Y He, Y Wang, Y Wang, Y Qiao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	313	2023
Internvideo: General video foundation models via generative and discriminative learning Y Wang, K Li, Y Li, Y He, B Huang, Z Zhao, H Zhang, J Xu, Y Liu, Z Wang, ... arXiv preprint arXiv:2212.03191, 2022	281	2022
Lavie: High-quality video generation with cascaded latent diffusion models Y Wang, X Chen, X Ma, S Zhou, Z Huang, Y Wang, C Yang, Y He, J Yu, ... arXiv preprint arXiv:2309.15103, 2023	158	2023
Internvid: A large-scale video-text dataset for multimodal understanding and generation Y Wang, Y He, Y Li, K Li, J Yu, X Ma, X Li, G Chen, X Chen, Y Wang, C He, ... arXiv preprint arXiv:2307.06942, 2023	155	2023
Mvbench: A comprehensive multi-modal video understanding benchmark K Li, Y Wang, Y He, Y Li, Y Wang, Y Liu, Z Wang, J Xu, G Chen, P Luo, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	150	2024
Forgerynet: A versatile benchmark for comprehensive forgery analysis Y He, B Gan, S Chen, Y Zhou, G Yin, L Song, L Sheng, J Shao, Z Liu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021	146	2021
Unmasked teacher: Towards training-efficient video foundation models K Li, Y Wang, Y Li, Y Wang, Y He, L Wang, Y Qiao Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	124	2023
Vbench: Comprehensive benchmark suite for video generative models Z Huang, Y He, J Yu, F Zhang, C Si, Y Jiang, Y Zhang, T Wu, Q Jin, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	115	2024
Uniformerv2: Spatiotemporal learning by arming image vits with video uniformer K Li, Y Wang, Y He, Y Li, Y Wang, L Wang, Y Qiao arXiv preprint arXiv:2211.09552, 2022	114	2022
Videomamba: State space model for efficient video understanding K Li, X Li, Y Wang, Y He, Y Wang, L Wang, Y Qiao European Conference on Computer Vision, 237-255, 2025	105	2025
Interngpt: Solving vision-centric tasks by interacting with chatgpt beyond language Z Liu, Y He, W Wang, W Wang, Y Wang, S Chen, Q Zhang, Z Lai, Y Yang, ... arXiv preprint arXiv:2305.05662, 2023	77	2023
Internvideo2: Scaling video foundation models for multimodal video understanding Y Wang, K Li, X Li, J Yu, Y He, G Chen, B Pei, R Zheng, J Xu, Z Wang, ... arXiv preprint arXiv:2403.15377, 2024	59	2024
Internvideo-ego4d: A pack of champion solutions to ego4d challenges G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ... arXiv preprint arXiv:2211.09529, 2022	40	2022
Uniformerv2: Unlocking the potential of image vits for video understanding K Li, Y Wang, Y He, Y Li, Y Wang, L Wang, Y Qiao Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	38	2023
Intern: A new learning paradigm towards general vision J Shao, S Chen, Y Li, K Wang, Z Yin, Y He, J Teng, Q Sun, M Gao, J Liu, ... arXiv preprint arXiv:2111.08687, 2021	34	2021
From gpt-4 to gemini and beyond: Assessing the landscape of mllms on generalizability, trustworthiness and causality through four modalities C Lu, C Qian, G Zheng, H Fan, H Gao, J Zhang, J Shao, J Deng, J Fu, ... arXiv preprint arXiv:2401.15071, 2024	12	2024
X-learner: Learning cross sources and tasks for universal visual representation Y He, G Huang, S Chen, J Teng, K Wang, Z Yin, L Sheng, Z Liu, Y Qiao, ... European Conference on Computer Vision, 509-528, 2022	6	2022
OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text Q Li, Z Chen, W Wang, W Wang, S Ye, Z Jin, G Chen, Y He, Z Gao, E Cui, ... arXiv preprint arXiv:2406.08418, 2024	5	2024
Harvest video foundation models via efficient post-pretraining Y Li, K Li, Y He, Y Wang, Y Wang, L Wang, Y Qiao, P Luo arXiv preprint arXiv:2310.19554, 2023	2	2023

Il sistema al momento non può eseguire l'operazione. Riprova più tardi.

Articoli 1–20

Citazioni per anno

Citazioni duplicate

Citazioni unite

Aggiungi coautoriCoautori

Segui

Citata da

Coautori