Følg
Haodong Duan 段浩东
Haodong Duan 段浩东
Shanghai AI Laboratory
Verifisert e-postadresse på pjlab.org.cn - Startside
Tittel
Sitert av
Sitert av
År
Mmbench: Is your multi-modal model an all-around player?
Y Liu, H Duan, Y Zhang, B Li, S Zhang, W Zhao, Y Yuan, J Wang, C He, ...
ECCV, 2024, 2023
8492023
Revisiting skeleton-based action recognition
H Duan, Y Zhao, K Chen, D Lin, B Dai
CVPR, 2022, 2022
7622022
Internlm2 technical report
Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ...
arXiv preprint arXiv:2403.17297, 2024
2782024
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ...
arXiv preprint arXiv:2401.16420, 2024
2432024
Opencompass: A universal evaluation platform for foundation models
OC Contributors
OpenCompass Project, https://github.com/open-compass/opencompass, 2023
2322023
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
MMA Contributors
OpenMMLab Project, https://github.com/open-mmlab/mmaction2, 2020
213*2020
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition
P Zhang, X Dong, B Wang, Y Cao, C Xu, L Ouyang, Z Zhao, H Duan, ...
arXiv preprint arXiv:2309.15112, 2023
2042023
Internlm: A multilingual language model with progressively enhanced capabilities
ILM Team
InternLM Project, https://github.com/InternLM/InternLM, 2023
2032023
Are We on the Right Way for Evaluating Large Vision-Language Models?
L Chen, J Li, X Dong, P Zhang, Y Zang, Z Chen, H Duan, J Wang, Y Qiao, ...
NeurIPS, 2024, 2024
1882024
PYSKL: Towards Good Practices for Skeleton Action Recognition
H Duan, J Wang, K Chen, D Lin
ACMMM, 2022, 2022
1582022
Internlm-xcomposer2-4khd: A pioneering large vision-language model handling resolutions from 336 pixels to 4k hd
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ...
NeurIPS, 2024, 2024
1222024
Omni-sourced webly-supervised learning for video recognition
H Duan, Y Zhao, Y Xiong, W Liu, D Lin
ECCV, 2020, 2020
1172020
Sharegpt4video: Improving video understanding and generation with better captions
L Chen, X Wei, J Li, X Dong, P Zhang, Y Zang, Z Chen, H Duan, B Lin, ...
NeurIPS D&B, 2024, 2024
1072024
Internlm-xcomposer-2.5: A versatile large vision language model supporting long-contextual input and output
P Zhang, X Dong, Y Zang, Y Cao, R Qian, L Chen, Q Guo, H Duan, ...
arXiv preprint arXiv:2407.03320, 2024
882024
SRPGAN: perceptual generative adversarial network for single image super resolution
B Wu, H Duan, Z Liu, G Sun
arXiv preprint arXiv:1712.05927, 2017
782017
Journeydb: A benchmark for generative image understanding
K Sun, J Pan, Y Ge, H Li, H Duan, X Wu, R Zhang, A Zhou, Z Qin, Y Wang, ...
NeurIPS D&B, 2023, 2024
772024
Dg-stgcn: Dynamic spatial-temporal modeling for skeleton-based action recognition
H Duan, J Wang, K Chen, D Lin
arXiv preprint arXiv:2210.05895, 2022
662022
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models
H Duan, J Yang, Y Qiao, X Fang, L Chen, Y Liu, X Dong, Y Zang, P Zhang, ...
ACMMM, 2024, 2024
65*2024
Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences
Y Zhou, H Duan, A Rao, B Su, J Wang
AAAI, 2023, 2023
412023
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark
H Liu, Z Zheng, Y Qiao, H Duan, Z Fei, F Zhou, W Zhang, S Zhang, D Lin, ...
ACL Findings, 2024, 2024
372024
Systemet kan ikke utføre handlingen. Prøv på nytt senere.
Artikler 1–20