OpenAlex Citation Counts

OpenAlex Citations Logo

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions
Noam Rotstein, David Bensaïd, Shaked Brody, et al.
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024), pp. 5677-5688
Open Access | Times Cited: 15

Showing 15 citing articles:

SCAP: enhancing image captioning through lightweight feature sifting and hierarchical decoding
Yuhao Zhang, Jiaqi Tong, Honglin Liu
The Visual Computer (2025)
Closed Access

Large language models (LLM) in computational social science: prospects, current state, and challenges
Surendrabikram Thapa, Shuvam Shiwakoti, Siddhant Bikram Shah, et al.
Social Network Analysis and Mining (2025) Vol. 15, Iss. 1
Open Access

AD2AT: Audio Description to Alternative Text, a Dataset of Alternative Text from Movies
Elise Lincker, Camille Guinaudeau, Shin’ichi Satoh
Lecture notes in computer science (2025), pp. 58-71
Closed Access

Knowledge guided relation enhancement for human-object interaction detection
Rui Su, Yongbin Gao, Wenjun Yu, et al.
Applied Intelligence (2025) Vol. 55, Iss. 6
Closed Access

CapsFusion: Rethinking Image-Text Data at Scale
Qiying Yu, Quan Sun, Xiaosong Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 14022-14032
Closed Access | Times Cited: 3

A Picture May Be Worth a Hundred Words for Visual Question Answering
Yusuke Hirota, Noa García, Mayu Otani, et al.
Electronics (2024) Vol. 13, Iss. 21, pp. 4290-4290
Open Access | Times Cited: 2

Control With Style: Style Embedding-based Variational Autoencoder for Controlled Stylized Caption Generation Framework
Dhruv Sharma, Chhavi Dhiman, Dinesh Kumar
IEEE Transactions on Cognitive and Developmental Systems (2024) Vol. 16, Iss. 6, pp. 2032-2042
Closed Access | Times Cited: 2

Evaluating the Fidelity of Image Captioning via Weighted Boolean Question Answering
Kaixuan Wang, Shasha Li, Jintao Tang, et al.
Lecture notes in computer science (2024), pp. 356-368
Closed Access

DanceCaps: Pseudo-Captioning for Dance Videos Using Large Language Models
Seohyun Kim, Kyogu Lee
Applied Sciences (2024) Vol. 14, Iss. 22, pp. 10116-10116
Open Access

A New Multimodal Large Model Framework for Knowledge-enhanced Image Caption Generation
Tengfei Wan, Huiyi Liu, Lijie Geng
(2024), pp. 571-575
Closed Access

Multi-Modal Inductive Framework for Text-Video Retrieval
Qian Li, Yucheng Zhou, Cheng Ji, et al.
(2024), pp. 2389-2398
Closed Access

M3DBench: Towards Omni 3D Assistant with Interleaved Multi-modal Instructions
Mingsheng Li, Xin Chen, Chi Zhang, et al.
Lecture notes in computer science (2024), pp. 41-59
Closed Access

Large Models in Dialogue for Active Perception and Anomaly Detection
Tzoulio Chamiti, Nikolaos Passalis, Anastasios Tefas
Lecture notes in computer science (2024), pp. 371-386
Closed Access

MCANet: Multimodal Caption Aware Training-Free Video Anomaly Detection via Large Language Model
Prabhu Prasad Dev, Raju Hazari, Pranesh Das
Lecture notes in computer science (2024), pp. 362-379
Closed Access

Page 1

Scroll to top