Haodong Duan 段浩东

Sitert av

	Alle	Siden 2020
Sitater	4419	4386
h-indeks	24	24
i10-indeks	31	31

2800

1400

700

2100

201920202021202220232024202523 25 65 208 632 2798 651

Offentlig tilgang

Vis alle

4 artikler

0 artikler

tilgjengelige

ikke tilgjengelige

Basert på finansieringsmandater

Medforfattere

Dahua LinThe Chinese University of Hong KongVerifisert e-postadresse på ie.cuhk.edu.hk
Kai ChenShanghai AI LaboratoryVerifisert e-postadresse på pjlab.org.cn
Jiaqi WangShanghai AI LaboratoryVerifisert e-postadresse på pjlab.org.cn
Xiaoyi DongShanghai AI LaboratoryVerifisert e-postadresse på mail.ustc.edu.cn
Yuhang ZangShanghai AI LaboratoryVerifisert e-postadresse på pjlab.org.cn
Songyang ZhangShanghai AI LaboratoryVerifisert e-postadresse på pjlab.org.cn
Yue ZhaoUniversity of Texas at AustinVerifisert e-postadresse på cs.utexas.edu
Wenwei ZhangShanghai AI LaboratoryVerifisert e-postadresse på ntu.edu.sg
Lin ChenUniversity of Science and Technology of ChinaVerifisert e-postadresse på mail.ustc.edu.cn
Yining LiShanghai AI LaboratoryVerifisert e-postadresse på pjlab.org.cn
Maosong CaoShanghai AI LabVerifisert e-postadresse på shanghaitech.edu.cn
Yuanhan ZhangPhD Candidate, MMLab@NTUVerifisert e-postadresse på e.ntu.edu.sg
Yuan LIU 柳源WeChat AIVerifisert e-postadresse på tencent.com
Xinyu FangShanghai Artificial Intelligence LaboratoryVerifisert e-postadresse på pjlab.org.cn
Bo DaiThe University of Hong KongVerifisert e-postadresse på hku.hk
Jinsong LiMMLab, The Chinese University of Hong KongVerifisert e-postadresse på ie.cuhk.edu.hk
Yuanjun XiongAdobe FireflyVerifisert e-postadresse på adobe.com
Wentao LiuSenseTime Group LimitedVerifisert e-postadresse på sensetime.com
Xiangyu ZhaoShanghai Jiaotong UniversityVerifisert e-postadresse på sjtu.edu.cn
Limin WangNanjing UniversityVerifisert e-postadresse på nju.edu.cn

Følg

Haodong Duan 段浩东

Shanghai AI Laboratory

Verifisert e-postadresse på pjlab.org.cn - Startside

Computer Vision Video Understanding Multimodal Learning


Tittel Sorter etter sitater Sorter etter år Sorter etter tittel	Sitert av Sitert av	År
Mmbench: Is your multi-modal model an all-around player? Y Liu, H Duan, Y Zhang, B Li, S Zhang, W Zhao, Y Yuan, J Wang, C He, ... ECCV, 2024, 2023	849	2023
Revisiting skeleton-based action recognition H Duan, Y Zhao, K Chen, D Lin, B Dai CVPR, 2022, 2022	762	2022
Internlm2 technical report Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ... arXiv preprint arXiv:2403.17297, 2024	278	2024
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ... arXiv preprint arXiv:2401.16420, 2024	243	2024
Opencompass: A universal evaluation platform for foundation models OC Contributors OpenCompass Project, https://github.com/open-compass/opencompass, 2023	232	2023
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark MMA Contributors OpenMMLab Project, https://github.com/open-mmlab/mmaction2, 2020	213*	2020
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition P Zhang, X Dong, B Wang, Y Cao, C Xu, L Ouyang, Z Zhao, H Duan, ... arXiv preprint arXiv:2309.15112, 2023	204	2023
Internlm: A multilingual language model with progressively enhanced capabilities ILM Team InternLM Project, https://github.com/InternLM/InternLM, 2023	203	2023
Are We on the Right Way for Evaluating Large Vision-Language Models? L Chen, J Li, X Dong, P Zhang, Y Zang, Z Chen, H Duan, J Wang, Y Qiao, ... NeurIPS, 2024, 2024	188	2024
PYSKL: Towards Good Practices for Skeleton Action Recognition H Duan, J Wang, K Chen, D Lin ACMMM, 2022, 2022	158	2022
Internlm-xcomposer2-4khd: A pioneering large vision-language model handling resolutions from 336 pixels to 4k hd X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ... NeurIPS, 2024, 2024	122	2024
Omni-sourced webly-supervised learning for video recognition H Duan, Y Zhao, Y Xiong, W Liu, D Lin ECCV, 2020, 2020	117	2020
Sharegpt4video: Improving video understanding and generation with better captions L Chen, X Wei, J Li, X Dong, P Zhang, Y Zang, Z Chen, H Duan, B Lin, ... NeurIPS D&B, 2024, 2024	107	2024
Internlm-xcomposer-2.5: A versatile large vision language model supporting long-contextual input and output P Zhang, X Dong, Y Zang, Y Cao, R Qian, L Chen, Q Guo, H Duan, ... arXiv preprint arXiv:2407.03320, 2024	88	2024
SRPGAN: perceptual generative adversarial network for single image super resolution B Wu, H Duan, Z Liu, G Sun arXiv preprint arXiv:1712.05927, 2017	78	2017
Journeydb: A benchmark for generative image understanding K Sun, J Pan, Y Ge, H Li, H Duan, X Wu, R Zhang, A Zhou, Z Qin, Y Wang, ... NeurIPS D&B, 2023, 2024	77	2024
Dg-stgcn: Dynamic spatial-temporal modeling for skeleton-based action recognition H Duan, J Wang, K Chen, D Lin arXiv preprint arXiv:2210.05895, 2022	66	2022
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models H Duan, J Yang, Y Qiao, X Fang, L Chen, Y Liu, X Dong, Y Zang, P Zhang, ... ACMMM, 2024, 2024	65*	2024
Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences Y Zhou, H Duan, A Rao, B Su, J Wang AAAI, 2023, 2023	41	2023
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark H Liu, Z Zheng, Y Qiao, H Duan, Z Fei, F Zhou, W Zhang, S Zhang, D Lin, ... ACL Findings, 2024, 2024	37	2024

Systemet kan ikke utføre handlingen. Prøv på nytt senere.

Artikler 1–20

Sitater per år

Duplikatsitater

Sammenslåtte sitater

Legg til medforfattereMedforfattere

Følg

Sitert av

Medforfattere