Heterogeneous memory enhanced multimodal attention model for video question answering C Fan, X Zhang, S Zhang, W Wang, C Zhang, H Huang Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 285 | 2019 |
Context-aware surveillance video summarization S Zhang, Y Zhu, AK Roy-Chowdhury IEEE Transactions on Image Processing 25 (11), 5469-5478, 2016 | 96 | 2016 |
A camera network tracking (camnet) dataset and performance baseline S Zhang, E Staudt, T Faltemier, AK Roy-Chowdhury 2015 IEEE Winter Conference on Applications of Computer Vision, 365-372, 2015 | 79 | 2015 |
Use all the labels: A hierarchical multi-label contrastive learning framework S Zhang, R Xu, C Xiong, C Ramaiah Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 49 | 2022 |
Tracking multiple interacting targets in a camera network S Zhang, Y Zhu, A Roy-Chowdhury Computer Vision and Image Understanding 134, 64-73, 2015 | 48 | 2015 |
Unicontrol: A unified diffusion model for controllable visual generation in the wild C Qin, S Zhang, N Yu, Y Feng, X Yang, Y Zhou, H Wang, JC Niebles, ... arXiv preprint arXiv:2305.11147, 2023 | 42 | 2023 |
Ulip-2: Towards scalable multimodal pre-training for 3d understanding L Xue, N Yu, S Zhang, J Li, R Martín-Martín, J Wu, C Xiong, R Xu, ... arXiv preprint arXiv:2305.08275, 2023 | 38 | 2023 |
Hive: Harnessing human feedback for instructional visual editing S Zhang, X Yang, Y Feng, C Qin, CC Chen, N Yu, Z Chen, H Wang, ... arXiv preprint arXiv:2303.09618, 2023 | 35 | 2023 |
Gluegen: Plug and play multi-modal encoders for x-to-image generation C Qin, N Yu, C Xing, S Zhang, Z Chen, S Ermon, Y Fu, C Xiong, R Xu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 14 | 2023 |
Video summarization through change detection in a non-overlapping camera network S Zhang, AK Roy-Chowdhury 2015 IEEE International Conference on Image Processing (ICIP), 3832-3836, 2015 | 9 | 2015 |
Online social behavior modeling for multi-target tracking S Zhang, A Das, C Ding, A Roy-Chowdhury Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2013 | 7 | 2013 |
Adaptive algorithm selection, with applications in pedestrian detection S Zhang, Q Zhu, A Roy-Chowdhury 2016 IEEE International Conference on Image Processing (ICIP), 3768-3772, 2016 | 5 | 2016 |
Template-based key-value extraction for inferring OCR key values within form images S Zhang, C Ramaiah, R Xu, C Xiong US Patent 11,495,011, 2022 | 1 | 2022 |
Adaptive algorithm and platform selection for visual detection and tracking S Zhang, Q Zhu, A Roy-Chowdhury arXiv preprint arXiv:1605.06597, 2016 | 1 | 2016 |
Systems and methods for vision-language distribution alignment S Zhang, LI Junnan, R Xu, C Xiong, C Ramaiah US Patent App. 17/589,725, 2023 | | 2023 |
The Plug and Play of Language Models for Text-to-image Generation C Qin, N Yu, C Xing, S Zhang, S Ermon, Y Fu, C Xiong, R Xu | | 2022 |
Systems and methods for hierarchical multi-label contrastive learning S Zhang, C Ramaiah, C Xiong, R Xu US Patent App. 17/328,779, 2022 | | 2022 |
Wide-Area Video Understanding: Tracking, Video Summarization and Algorithm-Platform Co-Design S Zhang University of California, Riverside, 2015 | | 2015 |