Takip et
Shusheng Yang
Shusheng Yang
PhD student @ NYU Courant
nyu.edu üzerinde doğrulanmış e-posta adresine sahip
Başlık
Alıntı yapanlar
Alıntı yapanlar
Yıl
Qwen technical report
J Bai, S Bai, Y Chu, Z Cui, K Dang, X Deng, Y Fan, W Ge, Y Han, F Huang, ...
Tech Report, 2023
26822023
Qwen-vl: A frontier large vision-language model with versatile abilities
J Bai*, S Bai*, S Yang*, S Wang, S Tan, P Wang, J Lin, C Zhou, J Zhou
Tech Report, 2023
11692023
Instances as queries
Y Fang*, S Yang*, X Wang, Y Li, C Fang, Y Shan, B Feng, W Liu
ICCV 2021, 2021
3692021
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
S Tong, E Brown, P Wu, S Woo, M Middepogu, SC Akula, J Yang, S Yang, ...
NeurIPS 2024 Oral, 2024
2262024
Crossover learning for fast online video instance segmentation
S Yang*, Y Fang*, X Wang, Y Li, C Fang, Y Shan, B Feng, W Liu
ICCV 2021, 2021
1322021
Temporally Efficient Vision Transformer for Video Instance Segmentation
S Yang, X Wang, Y Li, Y Fang, J Fang, W Liu, X Zhao, Y Shan
CVPR 2022 Oral, 2022
832022
Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection
Y Fang*, S Yang*, S Wang*, Y Ge, Y Shan, X Wang
ICCV 2023, 2022
682022
ViTMatte: Boosting image matting with pre-trained plain vision transformers
J Yao, X Wang, S Yang, B Wang
Information Fusion 2024, 2024
622024
Masked Image Modeling with Denoising Contrast
K Yi, Y Ge, X Li, S Yang, D Li, J Wu, Y Shan, X Qie
ICLR 2023, 2022
532022
Touchstone: Evaluating vision-language models by language models
S Bai, S Yang, J Bai, P Wang, X Zhang, J Lin, X Wang, C Zhou, J Zhou
ArXiv 2023, 2023
462023
Tracking Instances as Queries
S Yang*, Y Fang*, X Wang, Y Li, Y Shan, B Feng, W Liu
CVPRW 2021, 2021
142021
Masked Visual Reconstruction in Language Semantic Space
S Yang, Y Ge, K Yi, D Li, Y Shan, X Qie, X Wang
CVPR 2023, 2023
10*2023
MobileInst: Video Instance Segmentation on the Mobile
R Zhang*, T Cheng*, S Yang, H Jiang, S Zhang, J Lyu, X Li, X Ying, ...
AAAI 2024, 2024
82024
Relational Surrogate Loss Learning
T Huang, Z Li, H Lu, Y Shan, S Yang, Y Feng, F Wang, S You, C Xu
ICLR 2022, 2022
82022
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
J yang*, S Yang*, A Gupta*, R Han*, L Fei-Fei, S Xie
CVPR 2025, 2024
52024
Sistem, işlemi şu anda gerçekleştiremiyor. Daha sonra yeniden deneyin.
Makaleler 1–15