Jiasen Lu

Cited by

	All	Since 2019
Citations	17878	15947
h-index	25	25
i10-index	30	28

3900

1950

975

2925

201520162017201820192020202120222023202452 200 532 1029 1533 2058 2854 3426 3855 2214

Public access

View all

10 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Devi ParikhPreviously: FAIR and GenAI @ Meta. Georgia TechVerified email at gatech.edu
Dhruv BatraGeorgia Tech | Prev: FAIR (Meta AI)Verified email at gatech.edu
Stefan LeeAssistant Professor, Oregon State UniversityVerified email at oregonstate.edu
Jianwei YangPrincipal Researcher, Microsoft Research, RedmondVerified email at microsoft.com
Caiming XiongSalesforce ResearchVerified email at salesforce.com
Stanislaw AntolAutonomous Vehicles Software Engineer, Mercedes-Benz R&DVerified email at vt.edu
Richard Socheryou.comVerified email at stanford.edu
Aniruddha KembhaviSenior Director of Computer Vision, Allen Institute of Artificial IntelligenceVerified email at allenai.org
Roozbeh MottaghiFAIR, MetaVerified email at cs.stanford.edu
Rowan ZellersOpenAIVerified email at cs.washington.edu
Christopher ClarkAllen Institute for AIVerified email at allenai.org
Marcus RohrbachProfessor for Multimodal Reliable AI, TU Darmstadt, GermanyVerified email at tu-darmstadt.de
Vedanuj GoswamiResearch Engineer, Meta AIVerified email at meta.com
Adam FischPh.D. student, Massachusetts Institute of TechnologyVerified email at mit.edu
Antoine BordesHelsingVerified email at helsing.ai
Chih-Yao MaStaff Research Scientist @ GenAI, MetaVerified email at meta.com
Zuxuan WuFudan UniversityVerified email at fudan.edu.cn
Peng GaoShanghai AI LabVerified email at pjlab.org.cn
Yejin ChoiUniversity of Washington / Allen Institute for Artificial IntelligenceVerified email at cs.washington.edu
Jack Hesselsamaya.aiVerified email at samaya.ai

Jiasen Lu

Research Scientist, Apple

Verified email at apple.com - Homepage

Computer Vision Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Vqa: Visual question answering A Agrawal, J Lu, S Antol*, M Mitchell, CL Zitnick, D Parikh, D Batra International Journal of Computer Vision 123 (1), 4-31, 2017	6006*	2017
Vqa: Visual question answering S Antol, A Agrawal, J Lu, M Mitchell, D Batra, C Lawrence Zitnick, ... Proceedings of the IEEE International Conference on Computer Vision, 2425-2433, 2015	5998	2015
Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks J Lu, D Batra, D Parikh, S Lee Advances in neural information processing systems, 2019	3587	2019
Hierarchical question-image co-attention for visual question answering J Lu, J Yang, D Batra, D Parikh Advances in neural information processing systems 29, 2016	1957	2016
Knowing when to look: Adaptive attention via a visual sentinel for image captioning J Lu, C Xiong, D Parikh, R Socher Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017	1779	2017
Graph R-CNN for Scene Graph Generation J Yang, J Lu, S Lee, D Batra, D Parikh arXiv preprint arXiv:1808.00191, 2018	941	2018
Neural Baby Talk J Lu, J Yang, D Batra, D Parikh In Proceedings of the IEEE conference on computer vision and pattern …, 2018	550	2018
12-in-1: Multi-Task Vision and Language Representation Learning J Lu, V Goswami, M Rohrbach, D Parikh, S Lee Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019	518	2019
Parlai: A dialog research software platform AH Miller, W Feng, A Fisch, J Lu, D Batra, A Bordes, D Parikh, J Weston arXiv preprint arXiv:1705.06476, 2017	439	2017
Unified-IO: A unified model for vision, language, and multi-modal tasks J Lu, C Clark, R Zellers, R Mottaghi, A Kembhavi arXiv preprint arXiv:2206.08916, 2022	320	2022
Self-monitoring navigation agent via auxiliary progress estimation CY Ma, J Lu, Z Wu, G AlRegib, Z Kira, R Socher, C Xiong arXiv preprint arXiv:1901.03035, 2019	283	2019
Merlot reserve: Neural script knowledge through vision and language and sound R Zellers, J Lu, X Lu, Y Yu, Y Zhao, M Salehi, A Kusupati, J Hessel, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	215	2022
Best of both worlds: Transferring knowledge from discriminative learning to a generative visual dialog model J Lu, A Kannan, J Yang, D Parikh, D Batra Advances in Neural Information Processing Systems 30, 2017	144	2017
Sentinel gate for modulating auxiliary information in a long short-term memory (lstm) neural network LU Jiasen, C Xiong, R Socher US Patent 10,565,306, 2020	141	2020
Multi-modal answer validation for knowledge-based vqa J Wu, J Lu, A Sabharwal, R Mottaghi Proceedings of the AAAI conference on artificial intelligence 36 (3), 2712-2721, 2022	118	2022
Adaptive attention model for image captioning LU Jiasen, C Xiong, R Socher US Patent 10,565,305, 2020	113	2020
A Faster Pytorch Implementation of Faster R-CNN J Yang, J Lu, D Batra, D Parikh https://github.com/jwyang/faster-rcnn.pytorch, 2018	108	2018
X-lxmert: Paint, caption and answer questions with multi-modal transformers J Cho, J Lu, D Schwenk, H Hajishirzi, A Kembhavi arXiv preprint arXiv:2009.11278, 2020	104	2020
Spatially aware multimodal transformers for textvqa Y Kant, D Batra, P Anderson, A Schwing, D Parikh, J Lu, H Agrawal Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020	92	2020
Deeper lstm and normalized cnn visual question answering model J Lu, X Lin, D Batra, D Parikh GitHub repository 6, 2015	82	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors