Shu Zhang
Shu Zhang
Salesforce Inc.
Verified email at
Cited by
Cited by
Heterogeneous memory enhanced multimodal attention model for video question answering
C Fan, X Zhang, S Zhang, W Wang, C Zhang, H Huang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
Context-aware surveillance video summarization
S Zhang, Y Zhu, AK Roy-Chowdhury
IEEE Transactions on Image Processing 25 (11), 5469-5478, 2016
A camera network tracking (camnet) dataset and performance baseline
S Zhang, E Staudt, T Faltemier, AK Roy-Chowdhury
2015 IEEE Winter Conference on Applications of Computer Vision, 365-372, 2015
Use all the labels: A hierarchical multi-label contrastive learning framework
S Zhang, R Xu, C Xiong, C Ramaiah
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
Tracking multiple interacting targets in a camera network
S Zhang, Y Zhu, A Roy-Chowdhury
Computer Vision and Image Understanding 134, 64-73, 2015
Unicontrol: A unified diffusion model for controllable visual generation in the wild
C Qin, S Zhang, N Yu, Y Feng, X Yang, Y Zhou, H Wang, JC Niebles, ...
arXiv preprint arXiv:2305.11147, 2023
Hive: Harnessing human feedback for instructional visual editing
S Zhang, X Yang, Y Feng, C Qin, CC Chen, N Yu, Z Chen, H Wang, ...
arXiv preprint arXiv:2303.09618, 2023
Ulip-2: Towards scalable multimodal pre-training for 3d understanding
L Xue, N Yu, S Zhang, J Li, R Martín-Martín, J Wu, C Xiong, R Xu, ...
arXiv preprint arXiv:2305.08275, 2023
Gluegen: Plug and play multi-modal encoders for x-to-image generation
C Qin, N Yu, C Xing, S Zhang, Z Chen, S Ermon, Y Fu, C Xiong, R Xu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
Video summarization through change detection in a non-overlapping camera network
S Zhang, AK Roy-Chowdhury
2015 IEEE International Conference on Image Processing (ICIP), 3832-3836, 2015
Online social behavior modeling for multi-target tracking
S Zhang, A Das, C Ding, A Roy-Chowdhury
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2013
Adaptive algorithm selection, with applications in pedestrian detection
S Zhang, Q Zhu, A Roy-Chowdhury
2016 IEEE International Conference on Image Processing (ICIP), 3768-3772, 2016
Template-based key-value extraction for inferring OCR key values within form images
S Zhang, C Ramaiah, R Xu, C Xiong
US Patent 11,495,011, 2022
Adaptive algorithm and platform selection for visual detection and tracking
S Zhang, Q Zhu, A Roy-Chowdhury
arXiv preprint arXiv:1605.06597, 2016
Systems and methods for vision-language distribution alignment
S Zhang, LI Junnan, R Xu, C Xiong, C Ramaiah
US Patent App. 17/589,725, 2023
The Plug and Play of Language Models for Text-to-image Generation
C Qin, N Yu, C Xing, S Zhang, S Ermon, Y Fu, C Xiong, R Xu
Systems and methods for hierarchical multi-label contrastive learning
S Zhang, C Ramaiah, C Xiong, R Xu
US Patent App. 17/328,779, 2022
Wide-Area Video Understanding: Tracking, Video Summarization and Algorithm-Platform Co-Design
S Zhang
University of California, Riverside, 2015
The system can't perform the operation now. Try again later.
Articles 1–18