OpenAlex Citation Counts

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning
Jun Chen, Han Guo, Kai Yi, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 18009-18019
Open Access | Times Cited: 100

Showing 1-25 of 100 citing articles:

The Power of Generative AI: A Review of Requirements, Models, Input–Output Formats, Evaluation Metrics, and Challenges
Ajay Bandi, Pydi Venkata Satya Ramesh Adapa, Yudu Eswar Vinay Pratap Kumar Kuchi
Future Internet (2023) Vol. 15, Iss. 8, pp. 260-260
Open Access | Times Cited: 216

Unleashing the Power of Edge-Cloud Generative AI in Mobile Networks: A Survey of AIGC Services
Minrui Xu, Hongyang Du, Dusit Niyato, et al.
IEEE Communications Surveys & Tutorials (2024) Vol. 26, Iss. 2, pp. 1127-1170
Open Access | Times Cited: 90

Medical image captioning via generative pretrained transformers
Alexander Selivanov, Oleg Y. Rogov, Daniil Chesakov, et al.
Scientific Reports (2023) Vol. 13, Iss. 1
Open Access | Times Cited: 41

A Survey of Large Language Models for Healthcare: From Data, Technology, and Applications to Accountability and Ethics
Kai He, Rui Mao, Qika Lin, et al.
(2024)
Open Access | Times Cited: 39

Pre-Trained Language Models for Text Generation: A Survey
Junyi Li, Tianyi Tang, Wayne Xin Zhao, et al.
ACM Computing Surveys (2024) Vol. 56, Iss. 9, pp. 1-39
Open Access | Times Cited: 37

Vision-Language Models in Remote Sensing: Current progress and future trends
Li Xiang, Congcong Wen, Yuan Hu, et al.
IEEE Geoscience and Remote Sensing Magazine (2024) Vol. 12, Iss. 2, pp. 32-66
Open Access | Times Cited: 25

Foundations & Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions
Paul Pu Liang, Amir Zadeh, Louis‐Philippe Morency
ACM Computing Surveys (2024) Vol. 56, Iss. 10, pp. 1-42
Open Access | Times Cited: 21

Large language models (LLMs): survey, technical frameworks, and future challenges
Pranjal Kumar
Artificial Intelligence Review (2024) Vol. 57, Iss. 10
Open Access | Times Cited: 20

Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang, Wei Li, Jun Han, et al.
International Journal of Computer Vision (2024)
Closed Access | Times Cited: 19

EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain
Wei Zhang, Miaoxin Cai, Tong Zhang, et al.
IEEE Transactions on Geoscience and Remote Sensing (2024) Vol. 62, pp. 1-20
Open Access | Times Cited: 16

Generative AI-driven Semantic Communication Networks: Architecture, Technologies and Applications
Chengsi Liang, Hongyang Du, Yao Sun, et al.
IEEE Transactions on Cognitive Communications and Networking (2024) Vol. 11, Iss. 1, pp. 27-47
Open Access | Times Cited: 15

A survey of large language models for healthcare: from data, technology, and applications to accountability and ethics
Kai He, Rui Mao, Qika Lin, et al.
Information Fusion (2025), pp. 102963-102963
Open Access | Times Cited: 3

Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training
Anthony Meng Huat Tiong, Junnan Li, Boyang Li, et al.
(2022), pp. 951-967
Open Access | Times Cited: 39

FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context
Pinaki Nath Chowdhury, Aneeshan Sain, Ayan Kumar Bhunia, et al.
Lecture notes in computer science (2022), pp. 253-270
Closed Access | Times Cited: 38

Cross on Cross Attention: Deep Fusion Transformer for Image Captioning
Jing Zhang, Yingshuai Xie, Weichao Ding, et al.
IEEE Transactions on Circuits and Systems for Video Technology (2023) Vol. 33, Iss. 8, pp. 4257-4268
Closed Access | Times Cited: 32

Verbs in Action: Improving verb understanding in video-language models
Liliane Momeni, Mathilde Caron, Arsha Nagrani, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023)
Open Access | Times Cited: 25

Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition
Weidong Chen, Xiaofen Xing, Peihao Chen, et al.
IEEE Transactions on Affective Computing (2024) Vol. 15, Iss. 3, pp. 1711-1724
Open Access | Times Cited: 14

Pix4Point: Image Pretrained Standard Transformers for 3D Point Cloud Understanding
Guocheng Qian, Abdullah Hamdi, Xingdi Zhang, et al.
2021 International Conference on 3D Vision (3DV) (2024) Vol. 80, pp. 1280-1290
Open Access | Times Cited: 9

Is Attention all You Need in Medical Image Analysis? A Review
Giorgos Papanastasiou, Νικόλαος Δικαίος, Jiahao Huang, et al.
IEEE Journal of Biomedical and Health Informatics (2023) Vol. 28, Iss. 3, pp. 1398-1411
Open Access | Times Cited: 17

Text-Guided Foundation Model Adaptation for Pathological Image Classification
Yunkun Zhang, Jin Gao, Mu Zhou, et al.
Lecture notes in computer science (2023), pp. 272-282
Closed Access | Times Cited: 15

MAPL: Parameter-Efficient Adaptation of Unimodal Pre-Trained Models for Vision-Language Few-Shot Prompting
Oscar Mañas, Pau Rodriguez Lopez, Saba Ahmadi, et al.
(2023), pp. 2523-2548
Open Access | Times Cited: 14

Learning Combinatorial Prompts for Universal Controllable Image Captioning
Zhen Wang, Jun Xiao, Yueting Zhuang, et al.
International Journal of Computer Vision (2024)
Closed Access | Times Cited: 5

SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models
Ziyi Lin, Dongyang Liu, Renrui Zhang, et al.
Lecture notes in computer science (2024), pp. 36-55
Closed Access | Times Cited: 5

MedBLIP: Bootstrapping Language-Image Pretraining from 3D Medical Images and Texts
Qiu-hui Chen, Yi Hong
Lecture notes in computer science (2024), pp. 98-113
Closed Access | Times Cited: 5

Automatic Radiology Report Generator Using Transformer With Contrast-Based Image Enhancement
Hilya Tsaniya, Chastine Fatichah, Nanik Suciati
IEEE Access (2024) Vol. 12, pp. 25429-25442
Open Access | Times Cited: 4

Page 1 - Next Page

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-functional	1 year	The cookie is set by the GDPR Cookie Consent plugin to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.

Requested Article:

Showing 1-25 of 100 citing articles:

Your Privacy