Follow
Xiuye Gu
Xiuye Gu
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Open-vocabulary object detection via vision and language knowledge distillation
X Gu, TY Lin, W Kuo, Y Cui
International Conference on Learning Representations (ICLR), 2022
6342022
Scaling open-vocabulary image segmentation with image-level labels
G Ghiasi, X Gu, Y Cui, TY Lin
European Conference on Computer Vision, 540-557, 2022
2462022
Hplflownet: Hierarchical permutohedral lattice flownet for scene flow estimation on large-scale point clouds
X Gu, Y Wang, C Wu, YJ Lee, P Wang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
2152019
F-vlm: Open-vocabulary object detection upon frozen vision and language models
W Kuo, Y Cui, X Gu, AJ Piergiovanni, A Angelova
arXiv preprint arXiv:2209.15639, 2022
932022
Interspecies knowledge transfer for facial keypoint detection
M Rashid, X Gu, Y Jae Lee
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
572017
Password-conditioned Anonymization and Deanonymization with Face Identity Transformers
X Gu, W Luo, MS Ryoo, YJ Lee
European Conference on Computer Vision (ECCV), 2020
312020
Videopoet: A large language model for zero-shot video generation
D Kondratyuk, L Yu, X Gu, J Lezama, J Huang, R Hornung, H Adam, ...
arXiv preprint arXiv:2312.14125, 2023
222023
Language Model Beats Diffusion--Tokenizer is Key to Visual Generation
L Yu, J Lezama, NB Gundavarapu, L Versari, K Sohn, D Minnen, Y Cheng, ...
arXiv preprint arXiv:2310.05737, 2023
192023
A revisit on deep hashings for large-scale content based image retrieval
D Cai, X Gu, C Wang
arXiv preprint arXiv:1711.06016, 2017
162017
Photorealistic video generation with diffusion models
A Gupta, L Yu, K Sohn, X Gu, M Hahn, L Fei-Fei, I Essa, L Jiang, ...
arXiv preprint arXiv:2312.06662, 2023
152023
Dataseg: Taming a universal multi-dataset multi-task segmentation model
X Gu, Y Cui, J Huang, A Rashwan, X Yang, X Zhou, G Ghiasi, W Kuo, ...
Advances in Neural Information Processing Systems 36, 2024
102024
A simple zero-shot prompt weighting technique to improve prompt ensembling in text-image models
JU Allingham, J Ren, MW Dusenberry, X Gu, Y Cui, D Tran, JZ Liu, ...
International Conference on Machine Learning, 547-568, 2023
102023
Pixel aligned language models
J Xu, X Zhou, S Yan, X Gu, A Arnab, C Sun, X Wang, C Schmid
arXiv preprint arXiv:2312.09237, 2023
42023
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
S Sun, R Li, P Torr, X Gu, S Li
arXiv preprint arXiv:2312.07661, 2023
22023
PolyMaX: General Dense Prediction with Mask Transformer
X Yang, L Yuan, K Wilber, A Sharma, X Gu, S Qiao, S Debats, H Wang, ...
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
12024
Explore deep graph generation
X Gu
12019
Human or robot
X Gu, S Shi
12017
Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features
V Rathod, B Seybold, S Vijayanarasimhan, A Myers, X Gu, V Birodkar, ...
arXiv preprint arXiv:2212.10596, 2022
2022
Supplemental material for Depth Reconstruction from Stereo Image Pairs
X Gu
2015
Depth Reconstruction from Stereo Image Pairs
X Gu
The system can't perform the operation now. Try again later.
Articles 1–20