OpenAlex Citation Counts

OpenAlex Citations Logo

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang, Wei Li, Jun Han, et al.
International Journal of Computer Vision (2024)
Closed Access | Times Cited: 18

Showing 18 citing articles:

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Jiasen Lu, Christopher M. Clark, Sang-Ho Lee, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 7, pp. 26429-26445
Closed Access | Times Cited: 7

V*: Guided Visual Search as a Core Mechanism in Multimodal LLMs
Penghao Wu, Saining Xie
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 13084-13094
Closed Access | Times Cited: 6

LTGC: Long-Tail Recognition via Leveraging LLMs-Driven Generated Content
Qihao Zhao, Yalun Dai, Hao Li, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. abs/1707.06642, pp. 19510-19520
Closed Access | Times Cited: 4

Let storytelling tell vivid stories: A multi-modal-agent-based unified storytelling framework
Rongsheng Zhang, Jiji Tang, Chuanqi Zang, et al.
Neurocomputing (2025) Vol. 622, pp. 129316-129316
Closed Access

Focusing on feature-level domain alignment with text semantic for weakly-supervised domain adaptive object detection
Zichong Chen, Jian Cheng, Ziying Xia, et al.
Neurocomputing (2025), pp. 129435-129435
Closed Access

YOLOFLY: A Consumer-Centric Framework for Efficient Object Detection in UAV Imagery
Pengwei Ma, Hongmei Fei, Dingyi Jia, et al.
Electronics (2025) Vol. 14, Iss. 3, pp. 498-498
Open Access

MammoVLM: A generative large vision-language model for mammography-related diagnostic assistance
Zhenjie Cao, Zhuo Deng, Jie Ma, et al.
Information Fusion (2025), pp. 102998-102998
Closed Access

Enhancing Cryptocurrency Security: Leveraging Embeddings and Large Language Models for Creating Cryptocurrency Security Expert Systems
Ahmed Mohamed Abdallah, Heba K. Aslan, Mohamed S. Abdallah, et al.
Symmetry (2025) Vol. 17, Iss. 4, pp. 496-496
Open Access

Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Jianzong Wu, Xiangtai Li, Chenyang Si, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 12501-12511
Closed Access | Times Cited: 3

Groundhog Grounding Large Language Models to Holistic Segmentation
Yichi Zhang, Ziqiao Ma, Xiaofeng Gao, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 14227-14238
Closed Access | Times Cited: 3

Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection
Ting Lei, Shaofeng Yin, Yang Liu
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 16657-16667
Closed Access | Times Cited: 3

Auto-delineation of treatment target volume for radiation therapy using large language model-aided multimodal learning
Praveenbalaji Rajendran, Yizheng Chen, Liang Qiu, et al.
International Journal of Radiation Oncology*Biology*Physics (2024)
Closed Access | Times Cited: 2

Image processing and artificial intelligence for apple detection and localization: A comprehensive review
Afshin Azizi, Zhao Zhang, Wanjia Hua, et al.
Computer Science Review (2024) Vol. 54, pp. 100690-100690
Closed Access | Times Cited: 2

Model-as-a-Service (MaaS): A Survey
Wensheng Gan, Shicheng Wan, Philip S. Yu
2021 IEEE International Conference on Big Data (Big Data) (2023)
Open Access | Times Cited: 6

Enhancing Object Detection by Leveraging Large Language Models for Contextual Knowledge
Amirreza Rouhi, Diego PatiƱo, David K. Han
Lecture notes in computer science (2024), pp. 299-314
Closed Access

Towards Context-Rich Automated Biodiversity Assessments: Deriving AI-Powered Insights from Camera Trap Data
Paul Fergus, Carl Chalmers, Naomi Matthews, et al.
Sensors (2024) Vol. 24, Iss. 24, pp. 8122-8122
Open Access

Large language models enabled intelligent microstructure optimization and defects classification of welded titanium alloys
Suyang Zhang, William Yi Wang, Xinzhao Wang, et al.
Journal of Materials Informatics (2024)
Open Access

In-vitro Blood Purification using Tiny Pinch Holographic Optical Tweezers Based on Deep Learning
Xiao Luo, Yu Ching Wong, Xiangyu Chen, et al.
Biosensors and Bioelectronics (2024) Vol. 267, pp. 116781-116781
Closed Access

DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM
Yixuan Wu, Yizhou Wang, Shixiang Tang, et al.
Lecture notes in computer science (2024), pp. 164-182
Closed Access

Page 1

Scroll to top