
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
Unified Contrastive Learning in Image-Text-Label Space
Jianwei Yang, Chunyuan Li, Pengchuan Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 19141-19151
Open Access | Times Cited: 124
Jianwei Yang, Chunyuan Li, Pengchuan Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 19141-19151
Open Access | Times Cited: 124
Showing 1-25 of 124 citing articles:
TEMOS: Generating Diverse Human Motions from Textual Descriptions
Mathis Petrovich, Michael J. Black, Gül Varol
Lecture notes in computer science (2022), pp. 480-497
Closed Access | Times Cited: 176
Mathis Petrovich, Michael J. Black, Gül Varol
Lecture notes in computer science (2022), pp. 480-497
Closed Access | Times Cited: 176
DaViT: Dual Attention Vision Transformers
Mingyu Ding, Bin Xiao, Noel Codella, et al.
Lecture notes in computer science (2022), pp. 74-92
Closed Access | Times Cited: 172
Mingyu Ding, Bin Xiao, Noel Codella, et al.
Lecture notes in computer science (2022), pp. 74-92
Closed Access | Times Cited: 172
Vision-Language Models for Vision Tasks: A Survey
J Zhang, Jiaxing Huang, Sheng Jin, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 8, pp. 5625-5644
Open Access | Times Cited: 111
J Zhang, Jiaxing Huang, Sheng Jin, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 8, pp. 5625-5644
Open Access | Times Cited: 111
Generalized Decoding for Pixel, Image, and Language
Xueyan Zou, Zi-Yi Dou, Jianwei Yang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 15116-15127
Open Access | Times Cited: 95
Xueyan Zou, Zi-Yi Dou, Jianwei Yang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 15116-15127
Open Access | Times Cited: 95
WinCLIP: Zero-/Few-Shot Anomaly Classification and Segmentation
Jongheon Jeong, Yang Zou, Taewan Kim, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 77
Jongheon Jeong, Yang Zou, Taewan Kim, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 77
Sigmoid Loss for Language Image Pre-Training
Xiaohua Zhai, Basil Mustafa, А. И. Колесников, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023)
Open Access | Times Cited: 76
Xiaohua Zhai, Basil Mustafa, А. И. Колесников, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023)
Open Access | Times Cited: 76
Aligning Bag of Regions for Open-Vocabulary Object Detection
Size Wu, Wenwei Zhang, Sheng Jin, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 15254-15264
Open Access | Times Cited: 52
Size Wu, Wenwei Zhang, Sheng Jin, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 15254-15264
Open Access | Times Cited: 52
Revisiting Classifier: Transferring Vision-Language Models for Video Recognition
Wenhao Wu, Zhun Sun, Wanli Ouyang
Proceedings of the AAAI Conference on Artificial Intelligence (2023) Vol. 37, Iss. 3, pp. 2847-2855
Open Access | Times Cited: 51
Wenhao Wu, Zhun Sun, Wanli Ouyang
Proceedings of the AAAI Conference on Artificial Intelligence (2023) Vol. 37, Iss. 3, pp. 2847-2855
Open Access | Times Cited: 51
A Simple Framework for Open-Vocabulary Segmentation and Detection
Hao Zhang, Feng Li, Xueyan Zou, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 1020-1031
Open Access | Times Cited: 51
Hao Zhang, Feng Li, Xueyan Zou, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 1020-1031
Open Access | Times Cited: 51
Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images
Ming Lu, Bowen Chen, Andrew Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 19764-19775
Open Access | Times Cited: 44
Ming Lu, Bowen Chen, Andrew Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 19764-19775
Open Access | Times Cited: 44
VindLU: A Recipe for Effective Video-and-Language Pretraining
Feng Cheng, Xizi Wang, Jie Lei, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 38
Feng Cheng, Xizi Wang, Jie Lei, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 38
CLIP-guided Prototype Modulating for Few-shot Action Recognition
Xiang Wang, Shiwei Zhang, Jun Cen, et al.
International Journal of Computer Vision (2023) Vol. 132, Iss. 6, pp. 1899-1912
Closed Access | Times Cited: 26
Xiang Wang, Shiwei Zhang, Jun Cen, et al.
International Journal of Computer Vision (2023) Vol. 132, Iss. 6, pp. 1899-1912
Closed Access | Times Cited: 26
Generative Action Description Prompts for Skeleton-based Action Recognition
Wangmeng Xiang, Chao Li, Yuxuan Zhou, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 10242-10251
Open Access | Times Cited: 25
Wangmeng Xiang, Chao Li, Yuxuan Zhou, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 10242-10251
Open Access | Times Cited: 25
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
Filip Radenović, Abhimanyu Dubey, Abhishek Kadian, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 6967-6977
Open Access | Times Cited: 24
Filip Radenović, Abhimanyu Dubey, Abhishek Kadian, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 6967-6977
Open Access | Times Cited: 24
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone
Shraman Pramanick, Yale Song, Sayan Nag, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 5262-5274
Open Access | Times Cited: 23
Shraman Pramanick, Yale Song, Sayan Nag, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 5262-5274
Open Access | Times Cited: 23
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Zigang Geng, Binxin Yang, Tiankai Hang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 12709-12720
Closed Access | Times Cited: 12
Zigang Geng, Binxin Yang, Tiankai Hang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 12709-12720
Closed Access | Times Cited: 12
A Foundation Language-Image Model of the Retina (FLAIR): Encoding expert knowledge in text supervision
Julio Silva-Rodríguez, Hadi Chakor, Riadh Kobbi, et al.
Medical Image Analysis (2024) Vol. 99, pp. 103357-103357
Closed Access | Times Cited: 9
Julio Silva-Rodríguez, Hadi Chakor, Riadh Kobbi, et al.
Medical Image Analysis (2024) Vol. 99, pp. 103357-103357
Closed Access | Times Cited: 9
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Hao Li, Jinguo Zhu, Xiaohu Jiang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 20
Hao Li, Jinguo Zhu, Xiaohu Jiang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 20
Best of Both Worlds: Multimodal Contrastive Learning with Tabular and Imaging Data
Paul Hager, Martin J. Menten, Daniel Rueckert
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 20
Paul Hager, Martin J. Menten, Daniel Rueckert
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 20
CXR-CLIP: Toward Large Scale Chest X-ray Language-Image Pre-training
Kihyun You, Jawook Gu, Jiyeon Ham, et al.
Lecture notes in computer science (2023), pp. 101-111
Closed Access | Times Cited: 18
Kihyun You, Jawook Gu, Jiyeon Ham, et al.
Lecture notes in computer science (2023), pp. 101-111
Closed Access | Times Cited: 18
TOMGPT: Reliable Text-Only Training Approach for Cost-Effective Multi-modal Large Language Model
Yunkai Chen, Qimeng Wang, Shiwei Wu, et al.
ACM Transactions on Knowledge Discovery from Data (2024) Vol. 18, Iss. 7, pp. 1-19
Open Access | Times Cited: 6
Yunkai Chen, Qimeng Wang, Shiwei Wu, et al.
ACM Transactions on Knowledge Discovery from Data (2024) Vol. 18, Iss. 7, pp. 1-19
Open Access | Times Cited: 6
ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection
Thinh Phan, Viet-Khoa Vo-Ho, Duy Le, et al.
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024), pp. 7031-7040
Open Access | Times Cited: 6
Thinh Phan, Viet-Khoa Vo-Ho, Duy Le, et al.
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024), pp. 7031-7040
Open Access | Times Cited: 6
Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation
Kashu Yamazaki, Taisei Hanyu, Khoa Vo, et al.
(2024), pp. 9411-9417
Closed Access | Times Cited: 6
Kashu Yamazaki, Taisei Hanyu, Khoa Vo, et al.
(2024), pp. 9411-9417
Closed Access | Times Cited: 6
Hierarchical Prompt Learning for Multi-Task Learning
Yajing Liu, Yuning Lu, Hao Liu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Closed Access | Times Cited: 14
Yajing Liu, Yuning Lu, Hao Liu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Closed Access | Times Cited: 14
A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance
Zeyi Huang, Andy Zhou, Zijian Lin, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 11651-11661
Open Access | Times Cited: 14
Zeyi Huang, Andy Zhou, Zijian Lin, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 11651-11661
Open Access | Times Cited: 14