Grounding dino: Marrying dino with grounded pre-training for open-set object detection S Liu, Z Zeng, T Ren, F Li, H Zhang, J Yang, Q Jiang, C Li, J Yang, H Su, ... European Conference on Computer Vision, 38-55, 2024 | 1767 | 2024 |
Revisiting scene text recognition: A data perspective Q Jiang, J Wang, D Peng, C Liu, L Jin Proceedings of the IEEE/CVF international conference on computer vision …, 2023 | 51 | 2023 |
Visual in-context prompting F Li, Q Jiang, H Zhang, T Ren, S Liu, X Zou, H Xu, H Li, J Yang, C Li, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 29 | 2024 |
T-Rex2: Towards generic object detection via text-visual prompt synergy Q Jiang, F Li, Z Zeng, T Ren, S Liu, L Zhang European Conference on Computer Vision, 38-57, 2025 | 24 | 2025 |
Grounding dino 1.5: Advance the" edge" of open-set object detection T Ren, Q Jiang, S Liu, Z Zeng, W Liu, H Gao, H Huang, Z Ma, X Jiang, ... arXiv preprint arXiv:2405.10300, 2024 | 22 | 2024 |
T-Rex: Counting by visual prompting Q Jiang, F Li, T Ren, S Liu, Z Zeng, K Yu, L Zhang arXiv preprint arXiv:2311.13596, 2023 | 10 | 2023 |
Dino-x: A unified vision model for open-world object detection and understanding T Ren, Y Chen, Q Jiang, Z Zeng, Y Xiong, W Liu, Z Ma, J Shen, Y Gao, ... arXiv preprint arXiv:2411.14347, 2024 | 6 | 2024 |
ChatRex: Taming Multimodal LLM for Joint Perception and Understanding Q Jiang, G Luo, Y Yang, Y Xiong, Y Chen, Z Zeng, T Ren, L Zhang arXiv preprint arXiv:2411.18363, 2024 | 1 | 2024 |
QT-TextSR: Enhancing scene text image super-resolution via efficient interaction with text recognition using a Query-aware Transformer C Liu, Q Jiang, D Peng, Y Kong, J Zhang, L Xiong, J Duan, C Sun, L Jin Neurocomputing 620, 129241, 2025 | | 2025 |