Zejun Li

Cited by

	All	Since 2019
Citations	84	84
h-index	6	6
i10-index	3	3

20212022202320243 18 32 31

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Zhongyu Wei (魏忠钰)Associate Professor at School of Data Science, Fudan UniversityVerified email at fudan.edu.cn
Zhihao FanAlibabaVerified email at alibaba-inc.com
Huang Xuanjing (黄萱菁)Professor of Computer Science, Fudan UniversityVerified email at fudan.edu.cn
Siyuan WangUniversity of Southern CaliforniaVerified email at usc.edu
Jingjing ChenFudan UniversityVerified email at my.cityu.edu.hk
Qi Zhang (张奇)Professor of Computer Science, Fudan UniversityVerified email at fudan.edu.cn
Xinyi MouFudan UniversityVerified email at fudan.edu.cn

Zejun Li

Fudan University

Verified email at fudan.edu.cn

vision-language multi-modality


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Tcic: Theme concepts learning cross language and vision for image captioning Z Fan, Z Wei, S Wang, R Wang, Z Li, H Shan, X Huang arXiv preprint arXiv:2106.10936, 2021	24	2021
Mvp: Multi-stage vision-language pre-training via multi-level semantic alignment Z Li, Z Fan, H Tou, Z Wei arXiv preprint arXiv:2201.12596 1, 2022	13	2022
Mvptr: Multi-level semantic alignment for vision-language pre-training via multi-stage learning Z Li, Z Fan, H Tou, J Chen, Z Wei, X Huang Proceedings of the 30th ACM International Conference on Multimedia, 4395-4405, 2022	12	2022
Constructing phrase-level semantic labels to form multi-grained supervision for image-text retrieval Z Fan, Z Wei, Z Li, S Wang, H Shan, X Huang, J Fan Proceedings of the 2022 International Conference on Multimedia Retrieval …, 2022	8	2022
Unifying cross-lingual and cross-modal modeling towards weakly supervised multilingual vision-language pre-training Z Li, Z Fan, J Chen, Q Zhang, XJ Huang, Z Wei Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023	7	2023
Negative sample is negative in its own way: Tailoring negative sentences for image-text retrieval Z Fan, Z Wei, Z Li, S Wang, J Fan arXiv preprint arXiv:2111.03349, 2021	6	2021
A unified continuous learning framework for multi-modal knowledge discovery and pre-training Z Fan, Z Wei, J Chen, S Wang, Z Li, J Xu, X Huang arXiv preprint arXiv:2206.05555, 2022	4	2022
Reform-eval: Evaluating large vision language models via unified re-formulation of task-oriented benchmarks Z Li, Y Wang, M Du, Q Liu, B Wu, J Zhang, C Zhou, Z Fan, J Fu, J Chen, ... arXiv preprint arXiv:2310.02569, 2023	3	2023
An unsupervised sampling approach for image-sentence matching using document-level structural information Z Li, Z Wei, Z Fan, H Shan, X Huang Proceedings of the AAAI Conference on Artificial Intelligence 35 (15), 13324 …, 2021	3	2021
Unifying Local and Global Knowledge: Empowering Large Language Models as Political Experts with Knowledge Graphs X Mou, Z Li, H Lyu, J Luo, Z Wei Proceedings of the ACM on Web Conference 2024, 2603-2614, 2024	2	2024
EmbSpatial-Bench: Benchmarking Spatial Understanding for Embodied Tasks with Large Vision-Language Models M Du, B Wu, Z Li, X Huang, Z Wei arXiv preprint arXiv:2406.05756, 2024	1	2024
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models Z Li, R Luo, J Zhang, M Qiu, Z Wei arXiv preprint arXiv:2405.16919, 2024	1	2024

The system can't perform the operation now. Try again later.

Articles 1–12

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors