
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
Vision-Language Pre-Training with Triple Contrastive Learning
Jinyu Yang, Jiali Duan, Son N. Tran, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 15650-15659
Open Access | Times Cited: 182
Jinyu Yang, Jiali Duan, Son N. Tran, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 15650-15659
Open Access | Times Cited: 182
Showing 1-25 of 182 citing articles:
Multimodal Learning With Transformers: A Survey
Peng Xu, Xiatian Zhu, David A. Clifton
IEEE Transactions on Pattern Analysis and Machine Intelligence (2023) Vol. 45, Iss. 10, pp. 12113-12132
Open Access | Times Cited: 338
Peng Xu, Xiatian Zhu, David A. Clifton
IEEE Transactions on Pattern Analysis and Machine Intelligence (2023) Vol. 45, Iss. 10, pp. 12113-12132
Open Access | Times Cited: 338
VLP: A Survey on Vision-language Pre-training
Feilong Chen, Duzhen Zhang, Minglun Han, et al.
Deleted Journal (2023) Vol. 20, Iss. 1, pp. 38-56
Open Access | Times Cited: 128
Feilong Chen, Duzhen Zhang, Minglun Han, et al.
Deleted Journal (2023) Vol. 20, Iss. 1, pp. 38-56
Open Access | Times Cited: 128
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey
Xiao Wang, Guangyao Chen, Guangwu Qian, et al.
Deleted Journal (2023) Vol. 20, Iss. 4, pp. 447-482
Open Access | Times Cited: 92
Xiao Wang, Guangyao Chen, Guangwu Qian, et al.
Deleted Journal (2023) Vol. 20, Iss. 4, pp. 447-482
Open Access | Times Cited: 92
A Survey of Vision-Language Pre-Trained Models
Yifan Du, Zikang Liu, Junyi Li, et al.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence (2022), pp. 5436-5443
Open Access | Times Cited: 73
Yifan Du, Zikang Liu, Junyi Li, et al.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence (2022), pp. 5436-5443
Open Access | Times Cited: 73
MixGen: A New Multi-Modal Data Augmentation
Xiaoshuai Hao, Yi Zhu, Srikar Appalaraju, et al.
(2023)
Open Access | Times Cited: 55
Xiaoshuai Hao, Yi Zhu, Srikar Appalaraju, et al.
(2023)
Open Access | Times Cited: 55
Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images
Ming Lu, Bowen Chen, Andrew Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 19764-19775
Open Access | Times Cited: 44
Ming Lu, Bowen Chen, Andrew Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 19764-19775
Open Access | Times Cited: 44
Fine-grained Image-text Matching by Cross-modal Hard Aligning Network
Zhengxin Pan, Fangyu Wu, Bailing Zhang
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Closed Access | Times Cited: 39
Zhengxin Pan, Fangyu Wu, Bailing Zhang
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Closed Access | Times Cited: 39
@ CREPE: Can Vision-Language Foundation Models Reason Compositionally?
Zixian Ma, Jerry Hong, Mustafa Omer Gul, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023) Vol. 33, pp. 10910-10921
Open Access | Times Cited: 39
Zixian Ma, Jerry Hong, Mustafa Omer Gul, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023) Vol. 33, pp. 10910-10921
Open Access | Times Cited: 39
RaSa: Relation and Sensitivity Aware Representation Learning for Text-based Person Search
Yang Bai, Min Cao, Daming Gao, et al.
(2023), pp. 555-563
Open Access | Times Cited: 31
Yang Bai, Min Cao, Daming Gao, et al.
(2023), pp. 555-563
Open Access | Times Cited: 31
Detecting and Grounding Multi-Modal Media Manipulation
Rui Shao, Tianxing Wu, Ziwei Liu
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 29
Rui Shao, Tianxing Wu, Ziwei Liu
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 29
SuS-X: Training-Free Name-Only Transfer of Vision-Language Models
Vishaal Udandarao, Ankush Gupta, Samuel Albanie
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 2725-2736
Open Access | Times Cited: 26
Vishaal Udandarao, Ankush Gupta, Samuel Albanie
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 2725-2736
Open Access | Times Cited: 26
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
Filip Radenović, Abhimanyu Dubey, Abhishek Kadian, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 6967-6977
Open Access | Times Cited: 24
Filip Radenović, Abhimanyu Dubey, Abhishek Kadian, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 6967-6977
Open Access | Times Cited: 24
Teaching Structured Vision & Language Concepts to Vision & Language Models
Sivan Doveh, Assaf Arbelle, Sivan Harary, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 2657-2668
Closed Access | Times Cited: 23
Sivan Doveh, Assaf Arbelle, Sivan Harary, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 2657-2668
Closed Access | Times Cited: 23
Hallucination Augmented Contrastive Learning for Multimodal Large Language Model
Chaoya Jiang, Haiyang Xu, Mengfan Dong, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 27026-27036
Closed Access | Times Cited: 11
Chaoya Jiang, Haiyang Xu, Mengfan Dong, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 27026-27036
Closed Access | Times Cited: 11
Exploring scalable medical image encoders beyond text supervision
Fernando Pérez‐García, Harshita Sharma, Sam Bond-Taylor, et al.
Nature Machine Intelligence (2025)
Closed Access | Times Cited: 1
Fernando Pérez‐García, Harshita Sharma, Sam Bond-Taylor, et al.
Nature Machine Intelligence (2025)
Closed Access | Times Cited: 1
Multi-modal Alignment using Representation Codebook
Jiali Duan, Li‐Qun Chen, Son N. Tran, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 15630-15639
Open Access | Times Cited: 36
Jiali Duan, Li‐Qun Chen, Son N. Tran, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 15630-15639
Open Access | Times Cited: 36
Towards Adversarial Attack on Vision-Language Pre-training Models
Jiaming Zhang, Qi Yi, Jitao Sang
Proceedings of the 30th ACM International Conference on Multimedia (2022)
Open Access | Times Cited: 33
Jiaming Zhang, Qi Yi, Jitao Sang
Proceedings of the 30th ACM International Conference on Multimedia (2022)
Open Access | Times Cited: 33
Context-aware Alignment and Mutual Masking for 3D-Language Pre-training
Jin Zhao, Munawar Hayat, Yuwei Yang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 10984-10994
Closed Access | Times Cited: 21
Jin Zhao, Munawar Hayat, Yuwei Yang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 10984-10994
Closed Access | Times Cited: 21
PromptStyler: Prompt-driven Style Generation for Source-free Domain Generalization
Junhyeong Cho, Gilhyun Nam, Sungyeon Kim, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023)
Open Access | Times Cited: 20
Junhyeong Cho, Gilhyun Nam, Sungyeon Kim, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023)
Open Access | Times Cited: 20
Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning
Qian Jiang, Changyou Chen, Han Zhao, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 7661-7671
Open Access | Times Cited: 17
Qian Jiang, Changyou Chen, Han Zhao, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 7661-7671
Open Access | Times Cited: 17
HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning
Chia-Wen Kuo, Zsolt Kira
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 11039-11049
Open Access | Times Cited: 17
Chia-Wen Kuo, Zsolt Kira
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 11039-11049
Open Access | Times Cited: 17
Revisiting Multimodal Representation in Contrastive Learning: From Patch and Token Embeddings to Finite Discrete Tokens
Yuxiao Chen, Jianbo Yuan, Yu Tian, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 16
Yuxiao Chen, Jianbo Yuan, Yu Tian, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 16
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge
Wei Lin, Leonid Karlinsky, Nina Shvetsova, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 2839-2850
Open Access | Times Cited: 16
Wei Lin, Leonid Karlinsky, Nina Shvetsova, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 2839-2850
Open Access | Times Cited: 16
Detecting and Grounding Multi-Modal Media Manipulation and Beyond
Rui Shao, Tianxing Wu, Jianlong Wu, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 8, pp. 5556-5574
Open Access | Times Cited: 7
Rui Shao, Tianxing Wu, Jianlong Wu, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 8, pp. 5556-5574
Open Access | Times Cited: 7
Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval
Hailang Huang, Zhijie Nie, Ziqiao Wang, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2024) Vol. 38, Iss. 16, pp. 18298-18306
Open Access | Times Cited: 6
Hailang Huang, Zhijie Nie, Ziqiao Wang, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2024) Vol. 38, Iss. 16, pp. 18298-18306
Open Access | Times Cited: 6