Følg
Le Xue
Le Xue
Senior Applied Scientist, Salesforce Research
Verifisert e-postadresse på salesforce.com
Tittel
Sitert av
Sitert av
År
Ulip: Learning a unified representation of language, images, and point clouds for 3d understanding
L Xue, M Gao, C Xing, R Martín-Martín, J Wu, C Xiong, R Xu, JC Niebles, ...
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
2572023
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
L Xue, N Yu, S Zhang, J Li, R Martín-Martín, J Wu, C Xiong, R Xu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
1132023
Bolaa: Benchmarking and orchestrating llm-augmented autonomous agents
Z Liu, W Yao, J Zhang, L Xue, S Heinecke, R Murthy, Y Feng, Z Chen, ...
arXiv preprint arXiv:2308.05960, 2023
822023
Retroformer: Retrospective large language agents with policy gradient optimization
W Yao, S Heinecke, JC Niebles, Z Liu, Y Feng, L Xue, R Murthy, Z Chen, ...
arXiv preprint arXiv:2308.02151, 2023
642023
xgen-mm (blip-3): A family of open large multimodal models
L Xue, M Shu, A Awadalla, J Wang, A Yan, S Purushwalkam, H Zhou, ...
arXiv preprint arXiv:2408.08872, 2024
602024
X-instructblip: A framework for aligning x-modal instruction-aware representations to llms and emergent cross-modal reasoning
A Panagopoulou, L Xue, N Yu, J Li, D Li, S Joty, R Xu, S Savarese, ...
arXiv preprint arXiv:2311.18799, 2023
472023
Mint-1t: Scaling open-source multimodal data by 10x: A multimodal dataset with one trillion tokens
A Awadalla, L Xue, O Lo, M Shu, H Lee, E Guha, S Shen, M Awadalla, ...
Advances in Neural Information Processing Systems 37, 36805-36828, 2024
222024
Directed weighted network structure analysis of complex impedance measurements for characterizing oil-in-water bubbly flow
ZK Gao, WD Dang, L Xue, SS Zhang
Chaos: An Interdisciplinary Journal of Nonlinear Science 27 (3), 2017
152017
Rex: Rapid exploration and exploitation for ai agents
R Murthy, S Heinecke, JC Niebles, Z Liu, L Xue, W Yao, Y Feng, Z Chen, ...
arXiv preprint arXiv:2307.08962, 2023
82023
xgen-mm-vid (blip-3-video): You only need 32 tokens to represent a video even in vlms
MS Ryoo, H Zhou, S Kendre, C Qin, L Xue, M Shu, S Savarese, R Xu, ...
arXiv preprint arXiv:2410.16267, 2024
62024
Robustness evaluation of transformer-based form field extractors via form attacks
L Xue, M Gao, Z Chen, C Xiong, R Xu
International Conference on Document Analysis and Recognition, 167-184, 2023
62023
ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models
J Zhang, L Xue, L Song, J Wang, W Huang, M Shu, A Yan, Z Ma, ...
arXiv preprint arXiv:2412.07012, 2024
52024
Docquerynet: value retrieval with arbitrary queries for form-like documents
M Gao, L Xue, C Ramaiah, C Xing, R Xu, C Xiong
Proceedings of the 29th International Conference on Computational …, 2022
5*2022
Llavidal: Benchmarking large language vision models for daily activities of living
R Chakraborty, A Sinha, D Reilly, MK Govind, P Wang, F Bremond, S Das
arXiv preprint arXiv:2406.09390, 2024
32024
Image analysis based document processing for inference of key-value pairs in non-fixed digital documents
M Gao, C Zeyuan, L Xue, R Xu, C Xiong
US Patent 11,699,297, 2023
32023
Model-agnostic hierarchical attention for 3d object detection
M Shu, L Xue, N Yu, R Martín-Martín, JC Niebles, C Xiong, R Xu
arXiv e-prints, arXiv: 2301.02650, 2023
32023
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations
C Qin, C Xia, K Ramakrishnan, M Ryoo, L Tu, Y Feng, M Shu, H Zhou, ...
arXiv preprint arXiv:2408.12590, 2024
22024
BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions
A Awadalla, L Xue, M Shu, A Yan, J Wang, S Purushwalkam, S Shen, ...
arXiv preprint arXiv:2411.07461, 2024
12024
X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-Modal Reasoning
A Panagopoulou, L Xue, N Yu, J Li, D Li, S Joty, R Xu, S Savarese, ...
European Conference on Computer Vision, 177-197, 2024
12024
SYSTEMS AND METHODS FOR ORCHESTRATING LLM-AUGMENTED AUTONOMOUS AGENTS
Z Liu, W Yao, J Zhang, L Xue, S Heinecke, R Murthy, Y Feng, Z Chen, ...
US Patent App. 18/494,393, 2025
2025
Systemet kan ikke utføre handlingen. Prøv på nytt senere.
Artikler 1–20