Shu Zhang

Cited by

	All	Since 2019
Citations	778	693
h-index	9	9
i10-index	9	9

220

110

165

201420152016201720182019202020212022202320243 12 15 26 28 37 57 86 112 194 206

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Ran XuSalesforce ResearchVerified email at salesforce.com
Caiming XiongSalesforce ResearchVerified email at salesforce.com
Amit K. Roy-ChowdhuryProfessor and Bourns Family Faculty Fellow, UC Riverside; Fellow IEEE, IAPRVerified email at ece.ucr.edu
Ning YuNetflix Eyeline StudiosVerified email at scanlinevfx.com
Can QinSalesforceVerified email at husky.neu.edu
Yingying ZhuGoogle Inc.Verified email at ieee.org
Chenyou FanSouth China Normal University, Indiana University BloomingtonVerified email at m.scnu.edu.cn
Yihao FengSalesforce AI ResearchVerified email at salesforce.com
Qi ZhuAssociate Professor of Computer EngineeringVerified email at northwestern.edu
Abir DasAssistant Professor at IIT KharagpurVerified email at cse.iitkgp.ac.in

Shu Zhang

Salesforce Inc.

Verified email at salesforce.com

computer vision image generation 3D understanding multi-modal


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Heterogeneous memory enhanced multimodal attention model for video question answering C Fan, X Zhang, S Zhang, W Wang, C Zhang, H Huang Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019	300	2019
Context-aware surveillance video summarization S Zhang, Y Zhu, AK Roy-Chowdhury IEEE Transactions on Image Processing 25 (11), 5469-5478, 2016	97	2016
A camera network tracking (camnet) dataset and performance baseline S Zhang, E Staudt, T Faltemier, AK Roy-Chowdhury 2015 IEEE winter conference on applications of computer vision, 365-372, 2015	77	2015
Ulip-2: Towards scalable multimodal pre-training for 3d understanding L Xue, N Yu, S Zhang, A Panagopoulou, J Li, R Martín-Martín, J Wu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	61	2024
Use all the labels: A hierarchical multi-label contrastive learning framework S Zhang, R Xu, C Xiong, C Ramaiah Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	54	2022
Hive: Harnessing human feedback for instructional visual editing S Zhang, X Yang, Y Feng, C Qin, CC Chen, N Yu, Z Chen, H Wang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	51	2024
Unicontrol: A unified diffusion model for controllable visual generation in the wild C Qin, S Zhang, N Yu, Y Feng, X Yang, Y Zhou, H Wang, JC Niebles, ... arXiv preprint arXiv:2305.11147, 2023	51	2023
Tracking multiple interacting targets in a camera network S Zhang, Y Zhu, A Roy-Chowdhury Computer Vision and Image Understanding 134, 64-73, 2015	49	2015
Gluegen: Plug and play multi-modal encoders for x-to-image generation C Qin, N Yu, C Xing, S Zhang, Z Chen, S Ermon, Y Fu, C Xiong, R Xu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	15	2023
Video summarization through change detection in a non-overlapping camera network S Zhang, AK Roy-Chowdhury 2015 IEEE International Conference on Image Processing (ICIP), 3832-3836, 2015	9	2015
Online social behavior modeling for multi-target tracking S Zhang, A Das, C Ding, A Roy-Chowdhury Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2013	7	2013
Adaptive algorithm selection, with applications in pedestrian detection S Zhang, Q Zhu, A Roy-Chowdhury 2016 IEEE International Conference on Image Processing (ICIP), 3768-3772, 2016	5	2016
Template-based key-value extraction for inferring OCR key values within form images S Zhang, C Ramaiah, R Xu, C Xiong US Patent 11,495,011, 2022	1	2022
Adaptive algorithm and platform selection for visual detection and tracking S Zhang, Q Zhu, A Roy-Chowdhury arXiv preprint arXiv:1605.06597, 2016	1	2016
Systems and methods for text-to-image generation using language models N Yu, C Qin, C Xing, S Zhang, S Ermon, C Xiong, R Xu US Patent App. 18/162,535, 2024		2024
Systems and methods for vision-language distribution alignment S Zhang, LI Junnan, R Xu, C Xiong, C Ramaiah US Patent App. 17/589,725, 2023		2023
Systems and methods for hierarchical multi-label contrastive learning S Zhang, C Ramaiah, C Xiong, R Xu US Patent App. 17/328,779, 2022		2022
Wide-Area Video Understanding: Tracking, Video Summarization and Algorithm-Platform Co-Design S Zhang University of California, Riverside, 2015		2015
The Plug and Play of Language Models for Text-to-image Generation C Qin, N Yu, C Xing, S Zhang, S Ermon, Y Fu, C Xiong, R Xu

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors