Obserwuj
Siyang Qin
Siyang Qin
Software Engineer at Google DeepMind
Zweryfikowany adres z google.com - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
Towards unconstrained end-to-end text spotting
S Qin, A Bissacco, M Raptis, Y Fujii, Y Xiao
Proceedings of the IEEE/CVF international conference on computer vision …, 2019
1492019
Towards end-to-end unified scene text detection and layout analysis
S Long, S Qin, D Panteleev, A Bissacco, Y Fujii, M Raptis
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1042022
Rethinking text line recognition models
DH Diaz, S Qin, R Ingle, Y Fujii, A Bissacco
arXiv preprint arXiv:2104.07787, 2021
572021
Cascaded segmentation-detection networks for word-level text spotting
S Qin, R Manduchi
2017 14th IAPR international conference on document analysis and recognition …, 2017
572017
A fast and robust text spotter
S Qin, R Manduchi
2016 IEEE Winter Conference on Applications of Computer Vision (WACV), 1-8, 2016
382016
Rope: reading order equivariant positional encoding for graph-based document information extraction
CY Lee, CL Li, C Wang, R Wang, Y Fujii, S Qin, A Popat, T Pfister
arXiv preprint arXiv:2106.10786, 2021
292021
Automatic skin and hair masking using fully convolutional networks
S Qin, S Kim, R Manduchi
2017 IEEE International Conference on Multimedia and Expo (ICME), 103-108, 2017
272017
Scene text access: A comparison of mobile OCR modalities for blind users
L Neat, R Peng, S Qin, R Manduchi
Proceedings of the 24th International Conference on Intelligent User …, 2019
212019
Fluid: Scaling autoregressive text-to-image generative models with continuous tokens
L Fan, T Li, S Qin, Y Li, C Sun, M Rubinstein, D Sun, K He, Y Tian
arXiv preprint arXiv:2410.13863, 2024
202024
ICDAR 2023 competition on hierarchical text detection and recognition
S Long, S Qin, D Panteleev, A Bissacco, Y Fujii, M Raptis
International Conference on Document Analysis and Recognition, 483-497, 2023
162023
Dynamic mapping for multiview autostereoscopic displays
J Liu, T Malzbender, S Qin, B Zhang, CA Wu, J Davis
Stereoscopic Displays and Applications XXVI 9391, 400-407, 2015
132015
Formnetv2: Multimodal graph contrastive learning for form document information extraction
CY Lee, CL Li, H Zhang, T Dozat, V Perot, G Su, X Zhang, K Sohn, ...
arXiv preprint arXiv:2305.02549, 2023
122023
Robust and accurate text stroke segmentation
S Qin, P Ren, S Kim, R Manduchi
2018 IEEE Winter Conference on Applications of Computer Vision (WACV), 242-250, 2018
122018
Automatic semantic content removal by learning to neglect
S Qin, J Wei, R Manduchi
arXiv preprint arXiv:1807.07696, 2018
102018
Multi-planar monocular reconstruction of manhattan indoor scenes
S Kim, R Manduchi
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
92019
Paligemma 2: A family of versatile vlms for transfer
A Steiner, AS Pinto, M Tschannen, D Keysers, X Wang, Y Bitton, ...
arXiv preprint arXiv:2412.03555, 2024
72024
Hierarchical text spotter for joint text spotting and layout analysis
S Long, S Qin, Y Fujii, A Bissacco, M Raptis
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
62024
Towards end-to-end unified scene text detection and layout analysis. 2022 IEEE
S Long, S Qin, D Panteleev, A Bissacco, Y Fujii, M Raptis
CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1039-1049, 2022
52022
Text Spotting in the Wild
S Qin
University of California, Santa Cruz, 2018
12018
Joint text spotting and layout analysis
S Long, S Qin, Y Fujii, A Bissacco, M Raptis
US Patent App. 18/772,414, 2025
2025
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20