OpenAlex Citation Counts

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

Deep Learning Approaches on Image Captioning: A Review
Taraneh Ghandi, Hamid Reza Pourreza, Hamidreza Mahyar
ACM Computing Surveys (2023) Vol. 56, Iss. 3, pp. 1-39
Open Access | Times Cited: 67

Showing 1-25 of 67 citing articles:

Image captioning by diffusion models: A survey
Fatemeh Daneshfar, Ako Bartani, Pardis Lotfi
Engineering Applications of Artificial Intelligence (2024) Vol. 138, pp. 109288-109288
Closed Access | Times Cited: 11

Transforming dental diagnostics with artificial intelligence: advanced integration of ChatGPT and large language models for patient care
Masoumeh Farhadi Nia, Mohsen Ahmadi, Elyas Irankhah
Frontiers in Dental Medicine (2025) Vol. 5
Open Access | Times Cited: 1

An ensemble model with attention based mechanism for image captioning
Israa Al Badarneh, Bassam Hammo, Omar S. Al-Kadi
Computers & Electrical Engineering (2025) Vol. 123, pp. 110077-110077
Open Access | Times Cited: 1

Digital twin model for analyzing deformation and seepage in high earth-rock dams
Jichen Tian, Ruili Yu, Jiankang Chen, et al.
Automation in Construction (2025) Vol. 173, pp. 106079-106079
Closed Access | Times Cited: 1

DIC-Transformer: interpretation of plant disease classification results using image caption generation technology
Qingtian Zeng, Jian Sun, Shansong Wang
Frontiers in Plant Science (2024) Vol. 14
Open Access | Times Cited: 6

Psychological analysis of house-tree-person drawings based on multimodal large models
Dahong Xu, Siyu Jiang, Yihan Zhang, et al.
Multimedia Systems (2025) Vol. 31, Iss. 1
Closed Access

Decoding Images into Words: Comprehensive Study of Deep Learning in Image Captioning
Aleena Joji, Riccardo De Maria, Dilip Krishnan
SSRN Electronic Journal (2025)
Closed Access

Post-hoc XAI Method for Visual Question Answering (VQA)
Satya M. Muddamsetty, André Schmidt, Thomas B. Moeslund
Lecture notes in computer science (2025), pp. 369-382
Closed Access

A Holistic Review of Image-to-Text Conversion: Techniques, Evaluation Metrics, Multilingual Captioning, Storytelling and Integration
Anjali Sharma, Mayank Aggarwal
SN Computer Science (2025) Vol. 6, Iss. 3
Closed Access

A systematic literature review on incomplete multimodal learning: techniques and challenges
Yifan Zhan, Rui Yang, Jong-Bum You, et al.
Systems Science & Control Engineering (2025) Vol. 13, Iss. 1
Open Access

Pathology report generation from whole slide images with knowledge retrieval and multi-level regional feature selection
Dingyi Hu, Zhiguo Jiang, Jun Shi, et al.
Computer Methods and Programs in Biomedicine (2025) Vol. 263, pp. 108677-108677
Closed Access

Unveiling the Role of Memory in Shaping Visual Perception: Empirical Insights
Amrita Mukherjee, Avijit Paul, Ratan K. Saha, et al.
Lecture notes in computer science (2025), pp. 47-58
Closed Access

TA2V: Text-Audio Guided Video Generation
Minglu Zhao, Wenmin Wang, Tongbao Chen, et al.
IEEE Transactions on Multimedia (2024) Vol. 26, pp. 7250-7264
Closed Access | Times Cited: 4

Domain knowledge-driven image captioning for bridge damage description generation
CL Chai, Yan Gao, Guanyu Xiong, et al.
Automation in Construction (2025) Vol. 174, pp. 106116-106116
Closed Access

Image captioning based on scene graphs: A survey
Junhua Jia, Xiangqian Ding, Shunpeng Pang, et al.
Expert Systems with Applications (2023) Vol. 231, pp. 120698-120698
Closed Access | Times Cited: 10

Beyond images: an integrative multi-modal approach to chest x-ray report generation
Nurbanu Aksoy, Serge Sharoff, Selçuk Başer, et al.
Frontiers in Radiology (2024) Vol. 4
Open Access | Times Cited: 3

A comprehensive literature review on image captioning methods and metrics based on deep learning technique
Ahmad Sami Al-Shamayleh, Omar Adwan, Mohammad A. Alsharaiah, et al.
Multimedia Tools and Applications (2024) Vol. 83, Iss. 12, pp. 34219-34268
Closed Access | Times Cited: 3

Attention-based image captioning for structural health assessment of apartment buildings
Nguyen Ngoc Han Dinh, Hyunkyu Shin, Yonghan Ahn, et al.
Automation in Construction (2024) Vol. 167, pp. 105677-105677
Closed Access | Times Cited: 3

A Systematic Literature Review on Using the Encoder-Decoder Models for Image Captioning in English and Arabic Languages
Ashwaq Alsayed, Muhammad Arif, Thamir M. Qadah, et al.
Applied Sciences (2023) Vol. 13, Iss. 19, pp. 10894-10894
Open Access | Times Cited: 7

Images, Words, and Imagination: Accessible Descriptions to Support Blind and Low Vision Art Exploration and Engagement
Stacy A. Doore, David Istrati, Chenchang Xu, et al.
Journal of Imaging (2024) Vol. 10, Iss. 1, pp. 26-26
Open Access | Times Cited: 2

A survey on advancements in image–text multimodal models: From general techniques to biomedical implementations
Ruifeng Guo, Jingxuan Wei, Linzhuang Sun, et al.
Computers in Biology and Medicine (2024) Vol. 178, pp. 108709-108709
Closed Access | Times Cited: 2

CheXReport: A transformer-based architecture to generate chest X-ray reports suggestions
Felipe André Zeiser, Cristiano André da Costa, Gabriel de Oliveira Ramos, et al.
Expert Systems with Applications (2024) Vol. 255, pp. 124644-124644
Closed Access | Times Cited: 2

A survey of multimodal federated learning: background, applications, and perspectives
Hao Pan, Xiaoli Zhao, Lipeng He, et al.
Multimedia Systems (2024) Vol. 30, Iss. 4
Closed Access | Times Cited: 2

A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
Ting Yu, Xiaojun Lin, Shuhui Wang, et al.
IEEE Transactions on Circuits and Systems for Video Technology (2023) Vol. 34, Iss. 3, pp. 1322-1338
Open Access | Times Cited: 5

Arabic Image Captioning: The Effect of Text Pre-processing on the Attention Weights and the BLEU-N Scores
Moaz T. Lasheen, Nahla Barakat
International Journal of Advanced Computer Science and Applications (2022) Vol. 13, Iss. 7
Open Access | Times Cited: 8

Page 1 - Next Page

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-functional	1 year	The cookie is set by the GDPR Cookie Consent plugin to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.

Requested Article:

Showing 1-25 of 67 citing articles:

Your Privacy