Sihan Chen

Cited by

	All	Since 2019
Citations	343	343
h-index	7	7
i10-index	6	6

160

120

202120222023202416 59 157 108

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jing Liu 刘静Professor in Institute of Automation of the Chinese Academy Sciences (CASIA)Verified email at nlpr.ia.ac.cn
Xinxin Zhu 朱欣鑫Institute of Automation of the Chinese Academy Sciences (CASIA)Verified email at nlpr.ia.ac.cn
Longteng GuoAssociate Professor, Institute of Automation of the Chinese Academy Sciences (CASIA)Verified email at nlpr.ia.ac.cn
Xingjian HeInstitute of Automation of the Chinese Academy Sciences (CASIA)Verified email at nlpr.ia.ac.cn
Zijia ZhaoInstitute of Automation, Chinese Academy Sciences (CASIA)Verified email at ia.ac.cn
Handong LiInstitute of Automation, Chinese Academy of SciencesVerified email at ia.ac.cn
Xiaojie Jin, 靳潇杰Bytedance Research, USAVerified email at bytedance.com
Jiashi FengByteDance Inc.Verified email at bytedance.com
Zikang LiuInstitute of Automation, Chinese Academy of SciencesVerified email at ia.ac.cn
Weining WangInstitute of Automation, Chinese Academy of SciencesVerified email at nlpr.ia.ac.cn
Jiawei LiuByteDanceVerified email at bytedance.com
Yichen YanChinese Academic of ScienceVerified email at ia.ac.cn

Sihan Chen

Institute of Automation, Chinese Academy of Sciences

Verified email at nlpr.ia.ac.cn

Vision-Language Pretraining Multimodal Understanding


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Cptr: Full transformer network for image captioning W Liu, S Chen, L Guo, X Zhu, J Liu arXiv preprint arXiv:2101.10804, 2021	165	2021
Valor: Vision-audio-language omni-perception pretraining model and dataset S Chen, X He, L Guo, X Zhu, W Wang, J Tang, J Liu arXiv preprint arXiv:2304.08345, 2023	55	2023
Vast: A vision-audio-subtitle-text omni-modality foundation model and dataset S Chen, H Li, Q Wang, Z Zhao, M Sun, X Zhu, J Liu Advances in Neural Information Processing Systems 36, 2024	35	2024
Chatbridge: Bridging modalities with large language model as a language catalyst Z Zhao, L Guo, T Yue, S Chen, S Shao, X Zhu, Z Yuan, J Liu arXiv preprint arXiv:2305.16103, 2023	30	2023
Global-local propagation network for RGB-D semantic segmentation S Chen, X Zhu, W Liu, X He, J Liu arXiv preprint arXiv:2101.10801, 2021	19	2021
Vlab: Enhancing video language pre-training by feature adapting and blending X He, S Chen, F Ma, Z Huang, X Jin, Z Liu, D Fu, Y Yang, J Liu, J Feng arXiv preprint arXiv:2305.13167, 2023	18	2023
Vl-mamba: Exploring state space models for multimodal learning Y Qiao, Z Yu, L Guo, S Chen, Z Zhao, M Sun, Q Wu, J Liu arXiv preprint arXiv:2403.13600, 2024	8	2024
Mm21 pre-training for video understanding challenge: Video captioning with pretraining techniques S Chen, X Zhu, D Hao, W Liu, J Liu, Z Zhao, L Guo, J Liu Proceedings of the 29th ACM International Conference on Multimedia, 4853-4857, 2021	5	2021
Cosa: Concatenated sample pretrained vision-language foundation model S Chen, X He, H Li, X Jin, J Feng, J Liu The Twelfth International Conference on Learning Representations, 2023	3	2023
Sounding video generator: A unified framework for text-guided sounding video generation J Liu, W Wang, S Chen, X Zhu, J Liu IEEE Transactions on Multimedia, 2023	3	2023
GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER M Sun, W Wang, Z Qin, J Sun, S Chen, J Liu Advances in Neural Information Processing Systems 36, 2024	2	2024
Calibration & Reconstruction: Deep Integrated Language for Referring Image Segmentation Y Yan, X He, S Chen, J Liu arXiv preprint arXiv:2404.08281, 2024		2024
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner Z Liu, S Chen, L Guo, H Li, X He, J Liu Proceedings of the 31st ACM International Conference on Multimedia, 5120-5131, 2023		2023
EAVL: Explicitly Align Vision and Language for Referring Image Segmentation Y Yan, X He, W Wang, S Chen, J Liu arXiv preprint arXiv:2308.09779, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–14

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors