OpenAlex Citation Counts

OpenAlex Citations Logo

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

Scaling Language-Image Pre-Training via Masking
Yanghao Li, Haoqi Fan, Ronghang Hu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 125

Showing 1-25 of 125 citing articles:

A visual-language foundation model for computational pathology
Ming Lu, Bowen Chen, Drew F. K. Williamson, et al.
Nature Medicine (2024) Vol. 30, Iss. 3, pp. 863-874
Open Access | Times Cited: 108

Sigmoid Loss for Language Image Pre-Training
Xiaohua Zhai, Basil Mustafa, А. И. Колесников, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023)
Open Access | Times Cited: 72

Knowledge Graph Self-Supervised Rationalization for Recommendation
Yuhao Yang, Chao Huang, Lianghao Xia, et al.
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2023), pp. 3046-3056
Open Access | Times Cited: 70

RemoteCLIP: A Vision Language Foundation Model for Remote Sensing
Fan Liu, Delong Chen, Zhangqingyun Guan, et al.
IEEE Transactions on Geoscience and Remote Sensing (2024) Vol. 62, pp. 1-16
Open Access | Times Cited: 63

Transformer-Based Visual Segmentation: A Survey
Xiangtai Li, Henghui Ding, Haobo Yuan, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 12, pp. 10138-10163
Open Access | Times Cited: 62

Towards Open Vocabulary Learning: A Survey
Jianzong Wu, Xiangtai Li, Shilin Xu, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 7, pp. 5092-5113
Open Access | Times Cited: 42

Foundation models in robotics: Applications, challenges, and the future
Roya Firoozi, Johnathan Tucker, Stephen Tian, et al.
The International Journal of Robotics Research (2024)
Closed Access | Times Cited: 21

A survey on self-supervised learning for non-sequential tabular data
Wei‐Yao Wang, Weiwei Du, Derek Xu, et al.
Machine Learning (2025) Vol. 114, Iss. 1
Closed Access | Times Cited: 1

A Survey on Efficient Training of Transformers
Bohan Zhuang, Jing Liu, Zizheng Pan, et al.
(2023), pp. 6823-6831
Open Access | Times Cited: 27

M-FLAG: Medical Vision-Language Pre-training with Frozen Language Models and Latent Space Geometry Optimization
Che Liu, Sibo Cheng, Chen Chen, et al.
Lecture notes in computer science (2023), pp. 637-647
Closed Access | Times Cited: 25

EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone
Shraman Pramanick, Yale Song, Sayan Nag, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 5262-5274
Open Access | Times Cited: 23

Alpha-CLIP: A CLIP Model Focusing on Wherever you Want
Zeyi Sun, Fang Ye, Tong Wu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 13019-13029
Closed Access | Times Cited: 14

Open-Vocabulary SAM: Segment and Recognize Twenty-Thousand Classes Interactively
Haobo Yuan, Xiangtai Li, Chong Zhou, et al.
Lecture notes in computer science (2024), pp. 419-437
Closed Access | Times Cited: 11

Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement
Xiangyang Zhu, Renrui Zhang, Bowei He, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 2605-2615
Open Access | Times Cited: 16

SD-DiT: Unleashing the Power of Self-Supervised Discrimination in Diffusion Transformer*
Rui Zhu, Yingwei Pan, Yehao Li, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 23, pp. 8435-8445
Closed Access | Times Cited: 6

Transcriptomics-Guided Slide Representation Learning in Computational Pathology
Guillaume Jaume, Lukas Oldenburg, Anurag Vaidya, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 9632-9644
Closed Access | Times Cited: 6

A novel domain adaptation method with physical constraints for shale gas production forecasting
Liangjie Gou, Zhaozhong Yang, Chao Min, et al.
Applied Energy (2024) Vol. 371, pp. 123673-123673
Closed Access | Times Cited: 5

Perceptual Grouping in Contrastive Vision-Language Models
Kanchana Ranasinghe, Brandon McKinzie, Sachin Ravi, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 5548-5561
Open Access | Times Cited: 14

TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance
Kan Wu, Houwen Peng, Zhenghong Zhou, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 21913-21923
Open Access | Times Cited: 13

Mix-tower: Light visual question answering framework based on exclusive self-attention mechanism
Deguang Chen, Jianrui Chen, Luheng Yang, et al.
Neurocomputing (2024) Vol. 587, pp. 127686-127686
Closed Access | Times Cited: 5

A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions
Jack Urbanek, Florian Bordes, Pietro Astolfi, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. abs/2201.12086, pp. 26690-26699
Closed Access | Times Cited: 5

CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
Shuyang Sun, Runjia Li, Philip H. S. Torr, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 32, pp. 13171-13182
Closed Access | Times Cited: 5

A Multimodal Biomedical Foundation Model Trained from Fifteen Million Image–Text Pairs
Sheng Zhang, Yanbo Xu, Naoto Usuyama, et al.
NEJM AI (2024) Vol. 2, Iss. 1
Closed Access | Times Cited: 5

BT-Adapter: Video Conversation is Feasible Without Video Instruction Tuning
Ruyang Liu, Chen Li, Yixiao Ge, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 13658-13667
Closed Access | Times Cited: 4

Fine-Grained Visual Text Prompting
Lingfeng Yang, Xiang Li, Yueze Wang, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 47, Iss. 3, pp. 1594-1609
Closed Access | Times Cited: 4

Page 1 - Next Page

Scroll to top