
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
FILIP: Fine-grained Interactive Language-Image Pre-Training
Lewei Yao, Runhui Huang, Lu Hou, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 160
Lewei Yao, Runhui Huang, Lu Hou, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 160
Showing 1-25 of 160 citing articles:
Multimodal Learning With Transformers: A Survey
Peng Xu, Xiatian Zhu, David A. Clifton
IEEE Transactions on Pattern Analysis and Machine Intelligence (2023) Vol. 45, Iss. 10, pp. 12113-12132
Open Access | Times Cited: 338
Peng Xu, Xiatian Zhu, David A. Clifton
IEEE Transactions on Pattern Analysis and Machine Intelligence (2023) Vol. 45, Iss. 10, pp. 12113-12132
Open Access | Times Cited: 338
MaPLe: Multi-modal Prompt Learning
Muhammad Uzair Khattak, Hanoona Rasheed, Muhammad Maaz, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 19113-19122
Open Access | Times Cited: 276
Muhammad Uzair Khattak, Hanoona Rasheed, Muhammad Maaz, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 19113-19122
Open Access | Times Cited: 276
Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language Tasks
Wenhui Wang, Hangbo Bao, Dong Li, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 19175-19186
Closed Access | Times Cited: 260
Wenhui Wang, Hangbo Bao, Dong Li, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 19175-19186
Closed Access | Times Cited: 260
Multi-Modal Knowledge Graph Construction and Application: A Survey
Xiangru Zhu, Zhixu Li, Xiaodan Wang, et al.
IEEE Transactions on Knowledge and Data Engineering (2022) Vol. 36, Iss. 2, pp. 715-735
Open Access | Times Cited: 132
Xiangru Zhu, Zhixu Li, Xiaodan Wang, et al.
IEEE Transactions on Knowledge and Data Engineering (2022) Vol. 36, Iss. 2, pp. 715-735
Open Access | Times Cited: 132
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections
Chenliang Li, Haiyang Xu, Junfeng Tian, et al.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (2022)
Open Access | Times Cited: 93
Chenliang Li, Haiyang Xu, Junfeng Tian, et al.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (2022)
Open Access | Times Cited: 93
MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining
Xiaoyi Dong, Jianmin Bao, Yinglin Zheng, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 10995-11005
Open Access | Times Cited: 71
Xiaoyi Dong, Jianmin Bao, Yinglin Zheng, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 10995-11005
Open Access | Times Cited: 71
CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-Training
Tianyu Huang, Bowen Dong, Yunhan Yang, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 22100-22110
Open Access | Times Cited: 57
Tianyu Huang, Bowen Dong, Yunhan Yang, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 22100-22110
Open Access | Times Cited: 57
Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing
Shruthi Bannur, Stephanie L. Hyland, Qianchu Liu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 15016-15027
Open Access | Times Cited: 54
Shruthi Bannur, Stephanie L. Hyland, Qianchu Liu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 15016-15027
Open Access | Times Cited: 54
Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Wenhao Wu, Haipeng Luo, Bo Fang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 10704-10713
Open Access | Times Cited: 46
Wenhao Wu, Haipeng Luo, Bo Fang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 10704-10713
Open Access | Times Cited: 46
Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning
AJ Piergiovanni, Weicheng Kuo, Anelia Angelova
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 43
AJ Piergiovanni, Weicheng Kuo, Anelia Angelova
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 43
The Unreasonable Effectiveness of CLIP Features for Image Captioning: An Experimental Analysis
Manuele Barraco, Marcella Cornia, Silvia Cascianelli, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (2022)
Open Access | Times Cited: 54
Manuele Barraco, Marcella Cornia, Silvia Cascianelli, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (2022)
Open Access | Times Cited: 54
Image-text Retrieval: A Survey on Recent Research and Development
Min Cao, Shiping Li, Juntao Li, et al.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence (2022), pp. 5410-5417
Open Access | Times Cited: 47
Min Cao, Shiping Li, Juntao Li, et al.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence (2022), pp. 5410-5417
Open Access | Times Cited: 47
Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation
Wenliang Dai, Lu Hou, Lifeng Shang, et al.
Findings of the Association for Computational Linguistics: ACL 2022 (2022)
Open Access | Times Cited: 40
Wenliang Dai, Lu Hou, Lifeng Shang, et al.
Findings of the Association for Computational Linguistics: ACL 2022 (2022)
Open Access | Times Cited: 40
Learning Open-Vocabulary Semantic Segmentation Models From Natural Language Supervision
Jilan Xu, Junlin Hou, Yuejie Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 39
Jilan Xu, Junlin Hou, Yuejie Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 39
Self-regulating Prompts: Foundational Model Adaptation without Forgetting
Muhammad Uzair Khattak, Syed Talal Wasim, Muzammal Naseer, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 15144-15154
Open Access | Times Cited: 39
Muhammad Uzair Khattak, Syed Talal Wasim, Muzammal Naseer, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 15144-15154
Open Access | Times Cited: 39
I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification
Muhammad Ferjad Naeem, Muhammad Gul Zain Ali Khan, Yongqin Xian, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 34
Muhammad Ferjad Naeem, Muhammad Gul Zain Ali Khan, Yongqin Xian, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 34
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment
Lewei Yao, Jianhua Han, Xiaodan Liang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 23497-23506
Open Access | Times Cited: 33
Lewei Yao, Jianhua Han, Xiaodan Liang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 23497-23506
Open Access | Times Cited: 33
SuS-X: Training-Free Name-Only Transfer of Vision-Language Models
Vishaal Udandarao, Ankush Gupta, Samuel Albanie
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 2725-2736
Open Access | Times Cited: 26
Vishaal Udandarao, Ankush Gupta, Samuel Albanie
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 2725-2736
Open Access | Times Cited: 26
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
Filip Radenović, Abhimanyu Dubey, Abhishek Kadian, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 6967-6977
Open Access | Times Cited: 24
Filip Radenović, Abhimanyu Dubey, Abhishek Kadian, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 6967-6977
Open Access | Times Cited: 24
Teaching Structured Vision & Language Concepts to Vision & Language Models
Sivan Doveh, Assaf Arbelle, Sivan Harary, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 2657-2668
Closed Access | Times Cited: 23
Sivan Doveh, Assaf Arbelle, Sivan Harary, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 2657-2668
Closed Access | Times Cited: 23
Texts as Images in Prompt Tuning for Multi-Label Image Recognition
Zixian Guo, Bowen Dong, Zhilong Ji, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 2808-2817
Open Access | Times Cited: 23
Zixian Guo, Bowen Dong, Zhilong Ji, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 2808-2817
Open Access | Times Cited: 23
X2-VLM: All-in-One Pre-Trained Model for Vision-Language Tasks
Yan Zeng, Xinsong Zhang, Hang Li, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2023) Vol. 46, Iss. 5, pp. 3156-3168
Open Access | Times Cited: 22
Yan Zeng, Xinsong Zhang, Hang Li, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2023) Vol. 46, Iss. 5, pp. 3156-3168
Open Access | Times Cited: 22
An Empirical Study of CLIP for Text-Based Person Search
Min Cao, Yang Bai, Ziyin Zeng, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2024) Vol. 38, Iss. 1, pp. 465-473
Open Access | Times Cited: 11
Min Cao, Yang Bai, Ziyin Zeng, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2024) Vol. 38, Iss. 1, pp. 465-473
Open Access | Times Cited: 11
SoftCLIP: Softer Cross-Modal Alignment Makes CLIP Stronger
Yuting Gao, Jinfeng Liu, Zihan Xu, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2024) Vol. 38, Iss. 3, pp. 1860-1868
Open Access | Times Cited: 11
Yuting Gao, Jinfeng Liu, Zihan Xu, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2024) Vol. 38, Iss. 3, pp. 1860-1868
Open Access | Times Cited: 11
Delving into Multimodal Prompting for Fine-Grained Visual Classification
Xin Jiang, Hao Tang, Junyao Gao, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2024) Vol. 38, Iss. 3, pp. 2570-2578
Open Access | Times Cited: 10
Xin Jiang, Hao Tang, Junyao Gao, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2024) Vol. 38, Iss. 3, pp. 2570-2578
Open Access | Times Cited: 10