Obserwuj
Xin Li
Xin Li
Tencent Youtu Lab
Zweryfikowany adres z tencent.com
Tytuł
Cytowane przez
Cytowane przez
Rok
Neural collaborative graph machines for table structure recognition
H Liu, X Li, B Liu, D Jiang, Y Liu, B Ren
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
412022
The devil is in the frequency: Geminated gestalt autoencoder for self-supervised visual pre-training
H Liu, X Jiang, X Li, A Guo, Y Hu, D Jiang, B Ren
Proceedings of the AAAI Conference on Artificial Intelligence 37 (2), 1649-1656, 2023
362023
Show, read and reason: Table structure recognition with flexible context aggregator
H Liu, X Li, B Liu, D Jiang, Y Liu, B Ren, R Ji
Proceedings of the 29th ACM International Conference on Multimedia, 1084-1092, 2021
342021
Nommer: Nominate synergistic context in vision transformer for visual recognition
H Liu, X Jiang, X Li, Z Bao, D Jiang, B Ren
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
182022
Enhancing visual document understanding with contrastive learning in large visual-language models
X Li, Y Wu, X Jiang, Z Guo, M Gong, H Cao, Y Liu, D Jiang, X Sun
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
112024
Query-driven generative network for document information extraction in the wild
H Cao, X Li, J Ma, D Jiang, A Guo, Y Hu, H Liu, Y Liu, B Ren
Proceedings of the 30th ACM International Conference on Multimedia, 4261-4271, 2022
112022
Locate then generate: bridging vision and language with bounding box for scene-text VQA
Y Zhu, Z Liu, Y Liang, X Li, H Liu, C Bao, L Xu
Proceedings of the AAAI Conference on Artificial Intelligence 37 (9), 11479 …, 2023
92023
Grab what you need: Rethinking complex table structure recognition with flexible components deliberation
H Liu, X Li, M Gong, B Liu, Y Wu, D Jiang, Y Liu, X Sun
Proceedings of the AAAI Conference on Artificial Intelligence 38 (4), 3603-3611, 2024
82024
Relational representation learning in visually-rich documents
X Li, Y Zheng, Y Hu, H Cao, Y Wu, D Jiang, Y Liu, B Ren
Proceedings of the 30th ACM International Conference on Multimedia, 4614-4624, 2022
72022
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–9