Seguir
Songyang Zhang
Songyang Zhang
Otros nombres张 松阳
Shanghai AI Laboratory
Dirección de correo verificada de pjlab.org.cn - Página principal
Título
Citado por
Citado por
Año
Mmbench: Is your multi-modal model an all-around player?
Y Liu, H Duan, Y Zhang, B Li, S Zhang, W Zhao, Y Yuan, J Wang, C He, ...
European Conference on Computer Vision, 216-233, 2025
5572025
Part-aware prototype network for few-shot semantic segmentation
Y Liu, X Zhang, S Zhang, X He
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
3652020
Distribution Alignment: A Unified Framework for Long-tail Visual Recognition
S Zhang, Z Li, S Yan, X He, J Sun
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
3202021
Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation
R Li, S Zhang, B Wan, X He
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2362021
Internlm: A multilingual language model with progressively enhanced capabilities
ILM Team
2023-01-06)[2023-09-27]. https://github. com/InternLM/InternLM, 2023
1772023
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ...
arXiv preprint arXiv:2401.16420, 2024
1642024
OpenCompass: A universal evaluation platform for foundation models.
OC Contributors
https://github.com/open-compass/opencompass, 2023
1622023
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition
P Zhang, X Dong, B Wang, Y Cao, C Xu, L Ouyang, Z Zhao, H Duan, ...
arXiv preprint arXiv:2309.15112, 2023
1522023
Internlm2 technical report
Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ...
arXiv preprint arXiv:2403.17297, 2024
1472024
SGTR: End-to-end Scene Graph Generation with Transformer
R Li, S Zhang, X He
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2022
1062022
Dynamic context correspondence network for semantic alignment
S Huang, Q Wang, S Zhang, S Yan, X He
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
972019
LatentGNN: Learning Efficient Non-local Relations for Visual Recognition
S Zhang, S Yan, X He
Proceedings of the 36th International Conference on Machine Learning, 2019
912019
Internlm-xcomposer2-4khd: A pioneering large vision-language model handling resolutions from 336 pixels to 4k hd
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ...
arXiv preprint arXiv:2404.06512, 2024
752024
A Dual Attention Network with Semantic Embedding for Few-Shot Learning.
S Yan, S Zhang, X He
AAAI 33, 9079-9086, 2019
722019
Openmmlab’s image classification toolbox and benchmark
M Contributors
URL: https://github. com/open-mmlab/mmclassification 5, 2020
712020
Lawbench: Benchmarking legal knowledge of large language models
Z Fei, X Shen, D Zhu, F Zhou, Z Han, S Zhang, K Chen, Z Shen, J Ge
arXiv preprint arXiv:2309.16289, 2023
522023
Action Quality Assessment with Temporal Parsing Transformer
Y Bai, D Zhou, S Zhang, J Wang, E Ding, Y Guan, Y Long, J Wang
European Conference on Computer Vision, 2022
492022
An em framework for online incremental learning of semantic segmentation
S Yan, J Zhou, J Xie, S Zhang, X He
Proceedings of the 29th ACM international conference on multimedia, 3052-3060, 2021
482021
Learning Implicit Temporal Alignment for Few-shot Video Classification
S Zhang, J Zhou, X He
International Joint Conferences on Artificial Intelligence, 2021
432021
Predicting Salient Face in Multiple-face Videos
Y Liu, S Zhang, M Xu, X He
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
432017
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20