Jiasen Lu

Cited by

	All	Since 2019
Citations	19273	17289
h-index	26	26
i10-index	31	29

3900

1950

975

2925

201520162017201820192020202120222023202452 201 532 1033 1532 2060 2867 3433 3885 3450

Public access

View all

10 articles

1 article

available

not available

Based on funding mandates

Co-authors

Devi ParikhPreviously: FAIR and GenAI @ Meta. Georgia TechVerified email at gatech.edu
Dhruv BatraGeorgia Tech | Prev: FAIR (Meta AI)Verified email at gatech.edu
Stefan LeeAssistant Professor, Oregon State UniversityVerified email at oregonstate.edu
Jianwei YangPrincipal Researcher, Microsoft Research, RedmondVerified email at microsoft.com
Caiming XiongSalesforce ResearchVerified email at salesforce.com
Stanislaw AntolAutonomous Vehicles Software Engineer, Mercedes-Benz R&DVerified email at vt.edu
Richard Socheryou.comVerified email at stanford.edu
Aniruddha KembhaviSenior Director of Computer Vision, Allen Institute of Artificial IntelligenceVerified email at allenai.org
Roozbeh MottaghiFAIR, MetaVerified email at cs.stanford.edu
Rowan ZellersOpenAIVerified email at cs.washington.edu
Christopher ClarkAllen Institute for AIVerified email at allenai.org
Marcus RohrbachProfessor for Multimodal Reliable AI, TU Darmstadt, GermanyVerified email at tu-darmstadt.de
Vedanuj GoswamiLlama Team, Research Engineer, Meta AIVerified email at meta.com
Adam FischGoogle DeepMindVerified email at google.com
Antoine BordesHelsingVerified email at helsing.ai
Sheng LiQuantitative Foundation Associate Professor of Data Science, University of VirginiaVerified email at virginia.edu
Chih-Yao MaStaff Research Scientist @ GenAI, MetaVerified email at meta.com
Zuxuan WuFudan UniversityVerified email at fudan.edu.cn
Peng GaoShanghai AI LabVerified email at pjlab.org.cn
Yejin ChoiUniversity of Washington / Allen Institute for Artificial IntelligenceVerified email at cs.washington.edu

Jiasen Lu

Research Scientist, Apple

Verified email at apple.com - Homepage

Computer Vision Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Vqa: Visual question answering A Agrawal, J Lu, S Antol*, M Mitchell, CL Zitnick, D Parikh, D Batra International Journal of Computer Vision 123 (1), 4-31, 2017	6471*	2017
Vqa: Visual question answering S Antol, A Agrawal, J Lu, M Mitchell, D Batra, C Lawrence Zitnick, ... Proceedings of the IEEE International Conference on Computer Vision, 2425-2433, 2015	6463	2015
Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks J Lu, D Batra, D Parikh, S Lee Advances in neural information processing systems, 2019	3947	2019
Hierarchical question-image co-attention for visual question answering J Lu, J Yang, D Batra, D Parikh Advances in neural information processing systems 29, 2016	2029	2016
Knowing when to look: Adaptive attention via a visual sentinel for image captioning J Lu, C Xiong, D Parikh, R Socher Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017	1879	2017
Graph R-CNN for Scene Graph Generation J Yang, J Lu, S Lee, D Batra, D Parikh arXiv preprint arXiv:1808.00191, 2018	1000	2018
Neural Baby Talk J Lu, J Yang, D Batra, D Parikh In Proceedings of the IEEE conference on computer vision and pattern …, 2018	579	2018
12-in-1: Multi-Task Vision and Language Representation Learning J Lu, V Goswami, M Rohrbach, D Parikh, S Lee Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019	545	2019
Parlai: A dialog research software platform AH Miller, W Feng, A Fisch, J Lu, D Batra, A Bordes, D Parikh, J Weston arXiv preprint arXiv:1705.06476, 2017	454	2017
Unified-IO: A unified model for vision, language, and multi-modal tasks J Lu, C Clark, R Zellers, R Mottaghi, A Kembhavi arXiv preprint arXiv:2206.08916, 2022	375	2022
Self-monitoring navigation agent via auxiliary progress estimation CY Ma, J Lu, Z Wu, G AlRegib, Z Kira, R Socher, C Xiong arXiv preprint arXiv:1901.03035, 2019	300	2019
Merlot reserve: Neural script knowledge through vision and language and sound R Zellers, J Lu, X Lu, Y Yu, Y Zhao, M Salehi, A Kusupati, J Hessel, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	251	2022
Sentinel gate for modulating auxiliary information in a long short-term memory (lstm) neural network LU Jiasen, C Xiong, R Socher US Patent 10,565,306, 2020	151	2020
Best of both worlds: Transferring knowledge from discriminative learning to a generative visual dialog model J Lu, A Kannan, J Yang, D Parikh, D Batra Advances in Neural Information Processing Systems 30, 2017	150	2017
Multi-modal answer validation for knowledge-based vqa J Wu, J Lu, A Sabharwal, R Mottaghi Proceedings of the AAAI conference on artificial intelligence 36 (3), 2712-2721, 2022	137	2022
X-lxmert: Paint, caption and answer questions with multi-modal transformers J Cho, J Lu, D Schwenk, H Hajishirzi, A Kembhavi arXiv preprint arXiv:2009.11278, 2020	116	2020
Adaptive attention model for image captioning LU Jiasen, C Xiong, R Socher US Patent 10,565,305, 2020	116	2020
A Faster Pytorch Implementation of Faster R-CNN J Yang, J Lu, D Batra, D Parikh https://github.com/jwyang/faster-rcnn.pytorch, 2018	108	2018
Spatially aware multimodal transformers for textvqa Y Kant, D Batra, P Anderson, A Schwing, D Parikh, J Lu, H Agrawal Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020	99	2020
Deeper lstm and normalized cnn visual question answering model J Lu, X Lin, D Batra, D Parikh GitHub repository 6, 2015	82	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors