
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions
Noam Rotstein, David Bensaïd, Shaked Brody, et al.
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024), pp. 5677-5688
Open Access | Times Cited: 15
Noam Rotstein, David Bensaïd, Shaked Brody, et al.
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024), pp. 5677-5688
Open Access | Times Cited: 15
Showing 15 citing articles:
SCAP: enhancing image captioning through lightweight feature sifting and hierarchical decoding
Yuhao Zhang, Jiaqi Tong, Honglin Liu
The Visual Computer (2025)
Closed Access
Yuhao Zhang, Jiaqi Tong, Honglin Liu
The Visual Computer (2025)
Closed Access
Large language models (LLM) in computational social science: prospects, current state, and challenges
Surendrabikram Thapa, Shuvam Shiwakoti, Siddhant Bikram Shah, et al.
Social Network Analysis and Mining (2025) Vol. 15, Iss. 1
Open Access
Surendrabikram Thapa, Shuvam Shiwakoti, Siddhant Bikram Shah, et al.
Social Network Analysis and Mining (2025) Vol. 15, Iss. 1
Open Access
AD2AT: Audio Description to Alternative Text, a Dataset of Alternative Text from Movies
Elise Lincker, Camille Guinaudeau, Shin’ichi Satoh
Lecture notes in computer science (2025), pp. 58-71
Closed Access
Elise Lincker, Camille Guinaudeau, Shin’ichi Satoh
Lecture notes in computer science (2025), pp. 58-71
Closed Access
Knowledge guided relation enhancement for human-object interaction detection
Rui Su, Yongbin Gao, Wenjun Yu, et al.
Applied Intelligence (2025) Vol. 55, Iss. 6
Closed Access
Rui Su, Yongbin Gao, Wenjun Yu, et al.
Applied Intelligence (2025) Vol. 55, Iss. 6
Closed Access
CapsFusion: Rethinking Image-Text Data at Scale
Qiying Yu, Quan Sun, Xiaosong Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 14022-14032
Closed Access | Times Cited: 3
Qiying Yu, Quan Sun, Xiaosong Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 14022-14032
Closed Access | Times Cited: 3
A Picture May Be Worth a Hundred Words for Visual Question Answering
Yusuke Hirota, Noa García, Mayu Otani, et al.
Electronics (2024) Vol. 13, Iss. 21, pp. 4290-4290
Open Access | Times Cited: 2
Yusuke Hirota, Noa García, Mayu Otani, et al.
Electronics (2024) Vol. 13, Iss. 21, pp. 4290-4290
Open Access | Times Cited: 2
Control With Style: Style Embedding-based Variational Autoencoder for Controlled Stylized Caption Generation Framework
Dhruv Sharma, Chhavi Dhiman, Dinesh Kumar
IEEE Transactions on Cognitive and Developmental Systems (2024) Vol. 16, Iss. 6, pp. 2032-2042
Closed Access | Times Cited: 2
Dhruv Sharma, Chhavi Dhiman, Dinesh Kumar
IEEE Transactions on Cognitive and Developmental Systems (2024) Vol. 16, Iss. 6, pp. 2032-2042
Closed Access | Times Cited: 2
Evaluating the Fidelity of Image Captioning via Weighted Boolean Question Answering
Kaixuan Wang, Shasha Li, Jintao Tang, et al.
Lecture notes in computer science (2024), pp. 356-368
Closed Access
Kaixuan Wang, Shasha Li, Jintao Tang, et al.
Lecture notes in computer science (2024), pp. 356-368
Closed Access
DanceCaps: Pseudo-Captioning for Dance Videos Using Large Language Models
Seohyun Kim, Kyogu Lee
Applied Sciences (2024) Vol. 14, Iss. 22, pp. 10116-10116
Open Access
Seohyun Kim, Kyogu Lee
Applied Sciences (2024) Vol. 14, Iss. 22, pp. 10116-10116
Open Access
DTC: Difference-aware Transformer with CLIP Adaptation for Change Captioning
W.-Y. Liu
(2024), pp. 79-86
Closed Access
W.-Y. Liu
(2024), pp. 79-86
Closed Access
A New Multimodal Large Model Framework for Knowledge-enhanced Image Caption Generation
Tengfei Wan, Huiyi Liu, Lijie Geng
(2024), pp. 571-575
Closed Access
Tengfei Wan, Huiyi Liu, Lijie Geng
(2024), pp. 571-575
Closed Access
Multi-Modal Inductive Framework for Text-Video Retrieval
Qian Li, Yucheng Zhou, Cheng Ji, et al.
(2024), pp. 2389-2398
Closed Access
Qian Li, Yucheng Zhou, Cheng Ji, et al.
(2024), pp. 2389-2398
Closed Access
M3DBench: Towards Omni 3D Assistant with Interleaved Multi-modal Instructions
Mingsheng Li, Xin Chen, Chi Zhang, et al.
Lecture notes in computer science (2024), pp. 41-59
Closed Access
Mingsheng Li, Xin Chen, Chi Zhang, et al.
Lecture notes in computer science (2024), pp. 41-59
Closed Access
Large Models in Dialogue for Active Perception and Anomaly Detection
Tzoulio Chamiti, Nikolaos Passalis, Anastasios Tefas
Lecture notes in computer science (2024), pp. 371-386
Closed Access
Tzoulio Chamiti, Nikolaos Passalis, Anastasios Tefas
Lecture notes in computer science (2024), pp. 371-386
Closed Access
MCANet: Multimodal Caption Aware Training-Free Video Anomaly Detection via Large Language Model
Prabhu Prasad Dev, Raju Hazari, Pranesh Das
Lecture notes in computer science (2024), pp. 362-379
Closed Access
Prabhu Prasad Dev, Raju Hazari, Pranesh Das
Lecture notes in computer science (2024), pp. 362-379
Closed Access