
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing
Zequn Zeng, Hao Zhang, Ruiying Lu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 23465-23476
Open Access | Times Cited: 27
Zequn Zeng, Hao Zhang, Ruiying Lu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 23465-23476
Open Access | Times Cited: 27
Showing 1-25 of 27 citing articles:
MeaCap: Memory-Augmented Zero-shot Image Captioning
Zequn Zeng, Yan Xie, Hao Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 14100-14110
Closed Access | Times Cited: 7
Zequn Zeng, Yan Xie, Hao Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 14100-14110
Closed Access | Times Cited: 7
UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web
Y. H. Yan, Haomin Wen, Siru Zhong, et al.
Proceedings of the ACM Web Conference 2022 (2024), pp. 4006-4017
Open Access | Times Cited: 5
Y. H. Yan, Haomin Wen, Siru Zhong, et al.
Proceedings of the ACM Web Conference 2022 (2024), pp. 4006-4017
Open Access | Times Cited: 5
Attribute-Based Learning for Remote Sensing Image Captioning in Unseen Scenes
Zhang Guo, Haomin Liu, Zihao Ren, et al.
Remote Sensing (2025) Vol. 17, Iss. 7, pp. 1237-1237
Open Access
Zhang Guo, Haomin Liu, Zihao Ren, et al.
Remote Sensing (2025) Vol. 17, Iss. 7, pp. 1237-1237
Open Access
Plant Disease Phenotype Captioning Via Zero-Shot Learning with Semantic Correction Based on Llm
Yushan Xie, Xinyu Dong, Kejun Zhao, et al.
(2025)
Closed Access
Yushan Xie, Xinyu Dong, Kejun Zhao, et al.
(2025)
Closed Access
Sentiment Caption Generation from Visual Scene Using Pre-trained Language Model
Xiaochen Zhang, Jin Li, Mengfan Xu, et al.
Lecture notes in computer science (2025), pp. 187-201
Closed Access
Xiaochen Zhang, Jin Li, Mengfan Xu, et al.
Lecture notes in computer science (2025), pp. 187-201
Closed Access
Cross-modal Augmented Transformer for Automated Medical Report Generation
Yuhao Tang, Ye Yuan, Fei Tao, et al.
IEEE Journal of Translational Engineering in Health and Medicine (2025) Vol. 13, pp. 33-48
Open Access
Yuhao Tang, Ye Yuan, Fei Tao, et al.
IEEE Journal of Translational Engineering in Health and Medicine (2025) Vol. 13, pp. 33-48
Open Access
VTIENet: visual-text information enhancement network for image captioning
Juan Yang, Yuhang Wei, Ronggui Wang, et al.
Multimedia Systems (2025) Vol. 31, Iss. 1
Closed Access
Juan Yang, Yuhang Wei, Ronggui Wang, et al.
Multimedia Systems (2025) Vol. 31, Iss. 1
Closed Access
Scene graph sorting and shuffle polishing based controllable image captioning
Guichang Wu, Qian Zhao, Xiushu Liu
Signal Image and Video Processing (2025) Vol. 19, Iss. 4
Closed Access
Guichang Wu, Qian Zhao, Xiushu Liu
Signal Image and Video Processing (2025) Vol. 19, Iss. 4
Closed Access
CDZL: a controllable diversity zero-shot image caption model using large language models
Xin Zhao, Weiwei Kong, Zongyao Liu, et al.
Signal Image and Video Processing (2025) Vol. 19, Iss. 4
Closed Access
Xin Zhao, Weiwei Kong, Zongyao Liu, et al.
Signal Image and Video Processing (2025) Vol. 19, Iss. 4
Closed Access
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
Ting Yu, Xiaojun Lin, Shuhui Wang, et al.
IEEE Transactions on Circuits and Systems for Video Technology (2023) Vol. 34, Iss. 3, pp. 1322-1338
Open Access | Times Cited: 5
Ting Yu, Xiaojun Lin, Shuhui Wang, et al.
IEEE Transactions on Circuits and Systems for Video Technology (2023) Vol. 34, Iss. 3, pp. 1322-1338
Open Access | Times Cited: 5
EntroCap: Zero-shot image captioning with entropy-based retrieval
Jie Yan, Yuxiang Xie, Shiwei Zou, et al.
Neurocomputing (2024), pp. 128666-128666
Closed Access | Times Cited: 1
Jie Yan, Yuxiang Xie, Shiwei Zou, et al.
Neurocomputing (2024), pp. 128666-128666
Closed Access | Times Cited: 1
LMCap: Few-shot Multilingual Image Captioning by Retrieval Augmented Language Model Prompting
Rita Ramos, Bruno Martins, Desmond Elliott
Findings of the Association for Computational Linguistics: ACL 2022 (2023), pp. 1635-1651
Open Access | Times Cited: 3
Rita Ramos, Bruno Martins, Desmond Elliott
Findings of the Association for Computational Linguistics: ACL 2022 (2023), pp. 1635-1651
Open Access | Times Cited: 3
Zero-TextCap: Zero-shot Framework for Text-based Image Captioning
Dongsheng Xu, Wenye Zhao, Yi Cai, et al.
(2023), pp. 4949-4957
Closed Access | Times Cited: 3
Dongsheng Xu, Wenye Zhao, Yi Cai, et al.
(2023), pp. 4949-4957
Closed Access | Times Cited: 3
ZeroGen: Zero-Shot Multimodal Controllable Text Generation with Multiple Oracles
Haoqin Tu, Bowen Yang, Xianfeng Zhao
Lecture notes in computer science (2023), pp. 494-506
Closed Access | Times Cited: 3
Haoqin Tu, Bowen Yang, Xianfeng Zhao
Lecture notes in computer science (2023), pp. 494-506
Closed Access | Times Cited: 3
ControlCap: Controllable Captioning via No-Fuss Lexicon
Qiujie Xie, Qiming Feng, Yuejie Zhang, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 8326-8330
Closed Access
Qiujie Xie, Qiming Feng, Yuejie Zhang, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 8326-8330
Closed Access
SamCap: Energy-based Controllable Image Captioning by Gradient-Based Sampling
Yuchen Niu, Min Zhu, Zhihua Wei
(2024) Vol. 35, pp. 608-617
Open Access
Yuchen Niu, Min Zhu, Zhihua Wei
(2024) Vol. 35, pp. 608-617
Open Access
Exploring coherence from heterogeneous representations for OCR image captioning
Yao Zhang, Zijie Song, Zhenzhen Hu
Multimedia Systems (2024) Vol. 30, Iss. 5
Closed Access
Yao Zhang, Zijie Song, Zhenzhen Hu
Multimedia Systems (2024) Vol. 30, Iss. 5
Closed Access
HICEScore: A Hierarchical Metric for Image Captioning Evaluation
Zequn Zeng, Jianqiao Sun, Hao Zhang, et al.
(2024), pp. 866-875
Open Access
Zequn Zeng, Jianqiao Sun, Hao Zhang, et al.
(2024), pp. 866-875
Open Access
Controllable Contextualized Image Captioning: Directing the Visual Narrative Through User-Defined Highlights
Shunqi Mao, Chaoyi Zhang, Hang Su, et al.
Lecture notes in computer science (2024), pp. 464-481
Closed Access
Shunqi Mao, Chaoyi Zhang, Hang Su, et al.
Lecture notes in computer science (2024), pp. 464-481
Closed Access
Improving Zero-Shot Image Captioning Efficiency with Metropolis-Hastings
D. Du, Yujia Wu
Lecture notes in computer science (2024), pp. 305-318
Closed Access
D. Du, Yujia Wu
Lecture notes in computer science (2024), pp. 305-318
Closed Access
ICLB: Target‑Oriented Multimodal Sentiment Classification by Using Image Caption and Topic Model
Ziwei Chen, Fupeng Wei, Qiusheng Zheng, et al.
Mechanisms and machine science (2024), pp. 150-167
Closed Access
Ziwei Chen, Fupeng Wei, Qiusheng Zheng, et al.
Mechanisms and machine science (2024), pp. 150-167
Closed Access
Learning dual updatable memory modules for video anomaly detection
Liang Zhang, Shifeng Li, Cheng Yan, et al.
Multimedia Systems (2024) Vol. 31, Iss. 1
Closed Access
Liang Zhang, Shifeng Li, Cheng Yan, et al.
Multimedia Systems (2024) Vol. 31, Iss. 1
Closed Access
Group-Based Distinctive Image Captioning with Memory Difference Encoding and Attention
Jiuniu Wang, Wenjia Xu, Qingzhong Wang, et al.
International Journal of Computer Vision (2024)
Open Access
Jiuniu Wang, Wenjia Xu, Qingzhong Wang, et al.
International Journal of Computer Vision (2024)
Open Access
CIC-BART-SSA: Controllable Image Captioning with Structured Semantic Augmentation
Kalliopi Basioti, Mohamed Abbas Abdelsalam, Federico Fancellu, et al.
Lecture notes in computer science (2024), pp. 444-461
Closed Access
Kalliopi Basioti, Mohamed Abbas Abdelsalam, Federico Fancellu, et al.
Lecture notes in computer science (2024), pp. 444-461
Closed Access
FocusCap: Object-Focused Image Captioning with CLIP-Guided Language Model
Zihan Kong, Wei Li, Haiwei Zhang, et al.
Lecture notes in computer science (2023), pp. 319-330
Closed Access | Times Cited: 1
Zihan Kong, Wei Li, Haiwei Zhang, et al.
Lecture notes in computer science (2023), pp. 319-330
Closed Access | Times Cited: 1