OpenAlex Citation Counts

OpenAlex Citations Logo

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

RegionCLIP: Region-based Language-Image Pretraining
Yiwu Zhong, Jianwei Yang, Pengchuan Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 16772-16782
Open Access | Times Cited: 281

Showing 26-50 of 281 citing articles:

Learning to Generate Text-Grounded Mask for Open-World Semantic Segmentation from Only Image-Text Pairs
Junbum Cha, Jonghwan Mun, Byungseok Roh
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023) Vol. 24, pp. 11165-11174
Open Access | Times Cited: 34

DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment
Lewei Yao, Jianhua Han, Xiaodan Liang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 23497-23506
Open Access | Times Cited: 33

CLIP-guided Prototype Modulating for Few-shot Action Recognition
Xiang Wang, Shiwei Zhang, Jun Cen, et al.
International Journal of Computer Vision (2023) Vol. 132, Iss. 6, pp. 1899-1912
Closed Access | Times Cited: 27

OVTrack: Open-Vocabulary Multiple Object Tracking
Siyuan Li, Tobias Fischer, Ke Lei, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 5567-5577
Open Access | Times Cited: 25

CLIP-Count: Towards Text-Guided Zero-Shot Object Counting
Ruixiang Jiang, Lingbo Liu, Chang Wen Chen
(2023), pp. 4535-4545
Open Access | Times Cited: 23

AvatarCraft: Transforming Text into Neural Human Avatars with Parameterized Shape and Pose Control
Ruixiang Jiang, Can Wang, Jingbo Zhang, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 14325-14336
Open Access | Times Cited: 23

Alpha-CLIP: A CLIP Model Focusing on Wherever you Want
Zeyi Sun, Fang Ye, Tong Wu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 13019-13029
Closed Access | Times Cited: 14

Advances and Challenges in Deep Learning-Based Change Detection for Remote Sensing Images: A Review through Various Learning Paradigms
Lukang Wang, Min Zhang, Xu Gao, et al.
Remote Sensing (2024) Vol. 16, Iss. 5, pp. 804-804
Open Access | Times Cited: 13

Unified Open-Vocabulary Dense Visual Prediction
Hengcan Shi, Munawar Hayat, Jianfei Cai
IEEE Transactions on Multimedia (2024) Vol. 26, pp. 8704-8716
Open Access | Times Cited: 13

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie, Wei Li, Xiangtai Li, et al.
International Journal of Computer Vision (2024)
Closed Access | Times Cited: 12

Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
Phuc Nguyen, Tuan Ngo, Evangelos Kalogerakis, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 4018-4028
Closed Access | Times Cited: 10

Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images
Chaoqin Huang, Aofan Jiang, Jinghao Feng, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 54, pp. 11375-11385
Closed Access | Times Cited: 9

Zero-Shot Temporal Action Detection via Vision-Language Prompting
Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, et al.
Lecture notes in computer science (2022), pp. 681-697
Closed Access | Times Cited: 31

Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training
Haoxuan You, Luowei Zhou, Bin Xiao, et al.
Lecture notes in computer science (2022), pp. 69-87
Closed Access | Times Cited: 28

OvarNet: Towards Open-Vocabulary Object Attribute Recognition
Keyan Chen, Xiaolong Jiang, Yao Hu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 23518-23527
Open Access | Times Cited: 21

Position-Guided Text Prompt for Vision-Language Pre-Training
Jinpeng Wang, Pan Zhou, Mike Zheng Shou, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 19

Going Denser with Open-Vocabulary Part Segmentation
Peize Sun, Shoufa Chen, Chenchen Zhu, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 15407-15419
Open Access | Times Cited: 19

EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment
Cheng Shi, Sibei Yang
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 15678-15688
Open Access | Times Cited: 19

Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement
Xiangyang Zhu, Renrui Zhang, Bowei He, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 2605-2615
Open Access | Times Cited: 16

Re-scoring using image-language similarity for few-shot object detection
Min Jae Jung, Seung Dae Han, Joohee Kim
Computer Vision and Image Understanding (2024) Vol. 241, pp. 103956-103956
Open Access | Times Cited: 7

Object-centric Video Representation for Long-term Action Anticipation
Ce Zhang, Changcheng Fu, Shijie Wang, et al.
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024), pp. 6737-6747
Open Access | Times Cited: 6

Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation
Kashu Yamazaki, Taisei Hanyu, Khoa Vo, et al.
(2024), pp. 9411-9417
Closed Access | Times Cited: 6

Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
Yuiga Wada, Kanta Kaneda, Daichi Saito, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 37, pp. 13559-13568
Closed Access | Times Cited: 6

RegionGPT: Towards Region Understanding Vision Language Model
Qiushan Guo, Shalini De Mello, Hongxu Yin, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 13796-13806
Closed Access | Times Cited: 6

Scroll to top