
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
Deep Learning Approaches on Image Captioning: A Review
Taraneh Ghandi, Hamid Reza Pourreza, Hamidreza Mahyar
ACM Computing Surveys (2023) Vol. 56, Iss. 3, pp. 1-39
Open Access | Times Cited: 67
Taraneh Ghandi, Hamid Reza Pourreza, Hamidreza Mahyar
ACM Computing Surveys (2023) Vol. 56, Iss. 3, pp. 1-39
Open Access | Times Cited: 67
Showing 1-25 of 67 citing articles:
Image captioning by diffusion models: A survey
Fatemeh Daneshfar, Ako Bartani, Pardis Lotfi
Engineering Applications of Artificial Intelligence (2024) Vol. 138, pp. 109288-109288
Closed Access | Times Cited: 11
Fatemeh Daneshfar, Ako Bartani, Pardis Lotfi
Engineering Applications of Artificial Intelligence (2024) Vol. 138, pp. 109288-109288
Closed Access | Times Cited: 11
Transforming dental diagnostics with artificial intelligence: advanced integration of ChatGPT and large language models for patient care
Masoumeh Farhadi Nia, Mohsen Ahmadi, Elyas Irankhah
Frontiers in Dental Medicine (2025) Vol. 5
Open Access | Times Cited: 1
Masoumeh Farhadi Nia, Mohsen Ahmadi, Elyas Irankhah
Frontiers in Dental Medicine (2025) Vol. 5
Open Access | Times Cited: 1
An ensemble model with attention based mechanism for image captioning
Israa Al Badarneh, Bassam Hammo, Omar S. Al-Kadi
Computers & Electrical Engineering (2025) Vol. 123, pp. 110077-110077
Open Access | Times Cited: 1
Israa Al Badarneh, Bassam Hammo, Omar S. Al-Kadi
Computers & Electrical Engineering (2025) Vol. 123, pp. 110077-110077
Open Access | Times Cited: 1
Digital twin model for analyzing deformation and seepage in high earth-rock dams
Jichen Tian, Ruili Yu, Jiankang Chen, et al.
Automation in Construction (2025) Vol. 173, pp. 106079-106079
Closed Access | Times Cited: 1
Jichen Tian, Ruili Yu, Jiankang Chen, et al.
Automation in Construction (2025) Vol. 173, pp. 106079-106079
Closed Access | Times Cited: 1
DIC-Transformer: interpretation of plant disease classification results using image caption generation technology
Qingtian Zeng, Jian Sun, Shansong Wang
Frontiers in Plant Science (2024) Vol. 14
Open Access | Times Cited: 6
Qingtian Zeng, Jian Sun, Shansong Wang
Frontiers in Plant Science (2024) Vol. 14
Open Access | Times Cited: 6
Psychological analysis of house-tree-person drawings based on multimodal large models
Dahong Xu, Siyu Jiang, Yihan Zhang, et al.
Multimedia Systems (2025) Vol. 31, Iss. 1
Closed Access
Dahong Xu, Siyu Jiang, Yihan Zhang, et al.
Multimedia Systems (2025) Vol. 31, Iss. 1
Closed Access
Decoding Images into Words: Comprehensive Study of Deep Learning in Image Captioning
Aleena Joji, Riccardo De Maria, Dilip Krishnan
SSRN Electronic Journal (2025)
Closed Access
Aleena Joji, Riccardo De Maria, Dilip Krishnan
SSRN Electronic Journal (2025)
Closed Access
Post-hoc XAI Method for Visual Question Answering (VQA)
Satya M. Muddamsetty, André Schmidt, Thomas B. Moeslund
Lecture notes in computer science (2025), pp. 369-382
Closed Access
Satya M. Muddamsetty, André Schmidt, Thomas B. Moeslund
Lecture notes in computer science (2025), pp. 369-382
Closed Access
A Holistic Review of Image-to-Text Conversion: Techniques, Evaluation Metrics, Multilingual Captioning, Storytelling and Integration
Anjali Sharma, Mayank Aggarwal
SN Computer Science (2025) Vol. 6, Iss. 3
Closed Access
Anjali Sharma, Mayank Aggarwal
SN Computer Science (2025) Vol. 6, Iss. 3
Closed Access
A systematic literature review on incomplete multimodal learning: techniques and challenges
Yifan Zhan, Rui Yang, Jong-Bum You, et al.
Systems Science & Control Engineering (2025) Vol. 13, Iss. 1
Open Access
Yifan Zhan, Rui Yang, Jong-Bum You, et al.
Systems Science & Control Engineering (2025) Vol. 13, Iss. 1
Open Access
Pathology report generation from whole slide images with knowledge retrieval and multi-level regional feature selection
Dingyi Hu, Zhiguo Jiang, Jun Shi, et al.
Computer Methods and Programs in Biomedicine (2025) Vol. 263, pp. 108677-108677
Closed Access
Dingyi Hu, Zhiguo Jiang, Jun Shi, et al.
Computer Methods and Programs in Biomedicine (2025) Vol. 263, pp. 108677-108677
Closed Access
Unveiling the Role of Memory in Shaping Visual Perception: Empirical Insights
Amrita Mukherjee, Avijit Paul, Ratan K. Saha, et al.
Lecture notes in computer science (2025), pp. 47-58
Closed Access
Amrita Mukherjee, Avijit Paul, Ratan K. Saha, et al.
Lecture notes in computer science (2025), pp. 47-58
Closed Access
TA2V: Text-Audio Guided Video Generation
Minglu Zhao, Wenmin Wang, Tongbao Chen, et al.
IEEE Transactions on Multimedia (2024) Vol. 26, pp. 7250-7264
Closed Access | Times Cited: 4
Minglu Zhao, Wenmin Wang, Tongbao Chen, et al.
IEEE Transactions on Multimedia (2024) Vol. 26, pp. 7250-7264
Closed Access | Times Cited: 4
Domain knowledge-driven image captioning for bridge damage description generation
CL Chai, Yan Gao, Guanyu Xiong, et al.
Automation in Construction (2025) Vol. 174, pp. 106116-106116
Closed Access
CL Chai, Yan Gao, Guanyu Xiong, et al.
Automation in Construction (2025) Vol. 174, pp. 106116-106116
Closed Access
Image captioning based on scene graphs: A survey
Junhua Jia, Xiangqian Ding, Shunpeng Pang, et al.
Expert Systems with Applications (2023) Vol. 231, pp. 120698-120698
Closed Access | Times Cited: 10
Junhua Jia, Xiangqian Ding, Shunpeng Pang, et al.
Expert Systems with Applications (2023) Vol. 231, pp. 120698-120698
Closed Access | Times Cited: 10
Beyond images: an integrative multi-modal approach to chest x-ray report generation
Nurbanu Aksoy, Serge Sharoff, Selçuk Başer, et al.
Frontiers in Radiology (2024) Vol. 4
Open Access | Times Cited: 3
Nurbanu Aksoy, Serge Sharoff, Selçuk Başer, et al.
Frontiers in Radiology (2024) Vol. 4
Open Access | Times Cited: 3
A comprehensive literature review on image captioning methods and metrics based on deep learning technique
Ahmad Sami Al-Shamayleh, Omar Adwan, Mohammad A. Alsharaiah, et al.
Multimedia Tools and Applications (2024) Vol. 83, Iss. 12, pp. 34219-34268
Closed Access | Times Cited: 3
Ahmad Sami Al-Shamayleh, Omar Adwan, Mohammad A. Alsharaiah, et al.
Multimedia Tools and Applications (2024) Vol. 83, Iss. 12, pp. 34219-34268
Closed Access | Times Cited: 3
Attention-based image captioning for structural health assessment of apartment buildings
Nguyen Ngoc Han Dinh, Hyunkyu Shin, Yonghan Ahn, et al.
Automation in Construction (2024) Vol. 167, pp. 105677-105677
Closed Access | Times Cited: 3
Nguyen Ngoc Han Dinh, Hyunkyu Shin, Yonghan Ahn, et al.
Automation in Construction (2024) Vol. 167, pp. 105677-105677
Closed Access | Times Cited: 3
A Systematic Literature Review on Using the Encoder-Decoder Models for Image Captioning in English and Arabic Languages
Ashwaq Alsayed, Muhammad Arif, Thamir M. Qadah, et al.
Applied Sciences (2023) Vol. 13, Iss. 19, pp. 10894-10894
Open Access | Times Cited: 7
Ashwaq Alsayed, Muhammad Arif, Thamir M. Qadah, et al.
Applied Sciences (2023) Vol. 13, Iss. 19, pp. 10894-10894
Open Access | Times Cited: 7
Images, Words, and Imagination: Accessible Descriptions to Support Blind and Low Vision Art Exploration and Engagement
Stacy A. Doore, David Istrati, Chenchang Xu, et al.
Journal of Imaging (2024) Vol. 10, Iss. 1, pp. 26-26
Open Access | Times Cited: 2
Stacy A. Doore, David Istrati, Chenchang Xu, et al.
Journal of Imaging (2024) Vol. 10, Iss. 1, pp. 26-26
Open Access | Times Cited: 2
A survey on advancements in image–text multimodal models: From general techniques to biomedical implementations
Ruifeng Guo, Jingxuan Wei, Linzhuang Sun, et al.
Computers in Biology and Medicine (2024) Vol. 178, pp. 108709-108709
Closed Access | Times Cited: 2
Ruifeng Guo, Jingxuan Wei, Linzhuang Sun, et al.
Computers in Biology and Medicine (2024) Vol. 178, pp. 108709-108709
Closed Access | Times Cited: 2
CheXReport: A transformer-based architecture to generate chest X-ray reports suggestions
Felipe André Zeiser, Cristiano André da Costa, Gabriel de Oliveira Ramos, et al.
Expert Systems with Applications (2024) Vol. 255, pp. 124644-124644
Closed Access | Times Cited: 2
Felipe André Zeiser, Cristiano André da Costa, Gabriel de Oliveira Ramos, et al.
Expert Systems with Applications (2024) Vol. 255, pp. 124644-124644
Closed Access | Times Cited: 2
A survey of multimodal federated learning: background, applications, and perspectives
Hao Pan, Xiaoli Zhao, Lipeng He, et al.
Multimedia Systems (2024) Vol. 30, Iss. 4
Closed Access | Times Cited: 2
Hao Pan, Xiaoli Zhao, Lipeng He, et al.
Multimedia Systems (2024) Vol. 30, Iss. 4
Closed Access | Times Cited: 2
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
Ting Yu, Xiaojun Lin, Shuhui Wang, et al.
IEEE Transactions on Circuits and Systems for Video Technology (2023) Vol. 34, Iss. 3, pp. 1322-1338
Open Access | Times Cited: 5
Ting Yu, Xiaojun Lin, Shuhui Wang, et al.
IEEE Transactions on Circuits and Systems for Video Technology (2023) Vol. 34, Iss. 3, pp. 1322-1338
Open Access | Times Cited: 5
Arabic Image Captioning: The Effect of Text Pre-processing on the Attention Weights and the BLEU-N Scores
Moaz T. Lasheen, Nahla Barakat
International Journal of Advanced Computer Science and Applications (2022) Vol. 13, Iss. 7
Open Access | Times Cited: 8
Moaz T. Lasheen, Nahla Barakat
International Journal of Advanced Computer Science and Applications (2022) Vol. 13, Iss. 7
Open Access | Times Cited: 8