Swin transformer: Hierarchical vision transformer using shifted windows Z Liu, Y Lin, Y Cao, H Hu, Y Wei, Z Zhang, S Lin, B Guo Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 24084 | 2021 |
Swin transformer v2: Scaling up capacity and resolution Z Liu, H Hu, Y Lin, Z Yao, Z Xie, Y Wei, J Ning, Y Cao, Z Zhang, L Dong, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 1869 | 2022 |
Simmim: A simple framework for masked image modeling Z Xie, Z Zhang, Y Cao, Y Lin, J Bao, Z Yao, Q Dai, H Hu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 1354 | 2022 |
Propagate yourself: Exploring pixel-level consistency for unsupervised visual representation learning Z Xie, Y Lin, Z Zhang, Y Cao, S Lin, H Hu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 466 | 2021 |
Negative margin matters: Understanding margin in few-shot classification B Liu, Y Cao, Y Lin, Q Li, Z Zhang, M Long, H Hu Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 386 | 2020 |
Swin transformer: Hierarchical vision transformer using shifted windows. arXiv 2021 Z Liu, Y Lin, Y Cao, H Hu, Y Wei, Z Zhang, S Lin, B Guo arXiv preprint arXiv:2103.14030 10, 0 | 243 | |
Proceedings of the IEEE/CVF international conference on computer vision Z Liu, Y Lin, Y Cao, H Hu, Y Wei, Z Zhang, S Lin, B Guo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 228 | 2021 |
Self-supervised learning with swin transformers Z Xie, Y Lin, Z Yao, Z Zhang, Q Dai, Y Cao, H Hu arXiv preprint arXiv:2105.04553, 2021 | 199 | 2021 |
A simple baseline for open-vocabulary semantic segmentation with pre-trained vision-language model M Xu, Z Zhang, F Wei, Y Lin, Y Cao, H Hu, X Bai European Conference on Computer Vision, 736-753, 2022 | 198 | 2022 |
A simple baseline for zero-shot semantic segmentation with pre-trained vision-language model M Xu, Z Zhang, F Wei, Y Lin, Y Cao, H Hu, X Bai arXiv preprint arXiv:2112.14757 3, 2, 2021 | 107 | 2021 |
Parametric instance classification for unsupervised visual feature learning Y Cao, Z Xie, B Liu, Y Lin, Z Zhang, H Hu Advances in neural information processing systems 33, 15614-15624, 2020 | 66 | 2020 |
On data scaling in masked image modeling Z Xie, Z Zhang, Y Cao, Y Lin, Y Wei, Q Dai, H Hu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 54 | 2023 |
Leveraging batch normalization for vision transformers Z Yao, Y Cao, Y Lin, Z Liu, Z Zhang, H Hu Proceedings of the IEEE/CVF International Conference on Computer Vision, 413-422, 2021 | 43 | 2021 |
V-detr: Detr with vertex relative position encoding for 3d object detection Y Shen, Z Geng, Y Yuan, Y Lin, Z Liu, C Wang, H Hu, N Zheng, B Guo arXiv preprint arXiv:2308.04409, 2023 | 24 | 2023 |
Detr does not need multi-scale or locality design Y Lin, Y Yuan, Z Zhang, C Li, N Zheng, H Hu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 11 | 2023 |
Could Giant Pre-trained Image Models Extract Universal Representations? Y Lin, Z Liu, Z Zhang, H Hu, N Zheng, S Lin, Y Cao Advances in Neural Information Processing Systems 35, 8332-8346, 2022 | 10 | 2022 |
Swin transformer: Hierarchical vision transformer using shifted windows. arXiv preprint arXiv: 210314030 Z Liu, Y Lin, Y Cao, H Hu, Y Wei, Z Zhang, S Lin, B Guo | 9 | 2021 |
Bootstrap your object detector via mixed training M Xu, Z Zhang, F Wei, Y Lin, Y Cao, S Lin, H Hu, X Bai Advances in Neural Information Processing Systems 34, 11315-11325, 2021 | 6 | 2021 |
A Simple Approach and Benchmark for 21,000-Category Object Detection Y Lin, C Li, Y Cao, Z Zhang, J Wang, L Wang, Z Liu, H Hu European Conference on Computer Vision, 1-18, 2022 | 1 | 2022 |
AugDETR: Improving Multi-scale Learning for Detection Transformer J Dong, Y Lin, C Li, S Zhou, N Zheng European Conference on Computer Vision, 238-255, 2025 | | 2025 |