OpenAlex Citation Counts

OpenAlex Citations Logo

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

From Images to Textual Prompts: Zero-shot Visual Question Answering with Frozen Large Language Models
Jiaxian Guo, Junnan Li, Dongxu Li, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 10867-10877
Closed Access | Times Cited: 59

Showing 26-50 of 59 citing articles:

LLM-based framework for bearing fault diagnosis
Laifa Tao, Haifei Liu, Guoao Ning, et al.
Mechanical Systems and Signal Processing (2024) Vol. 224, pp. 112127-112127
Closed Access | Times Cited: 2

Consolidating Trees of Robotic Plans Generated Using Large Language Models to Improve Reliability
Md Sadman Sakib, Yu Sun
Deleted Journal (2024) Vol. 01, Iss. 01
Closed Access | Times Cited: 1

Situational Awareness Matters in 3D Vision Language Reasoning
Yunze Man, Liang-Yan Gui, Yu-Xiong Wang
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 13678-13688
Closed Access | Times Cited: 1

BRAVE: Broadening the Visual Encoding of Vision-Language Models
Oğuzhan Fatih Kar, Alessio Tonioni, Petra Poklukar, et al.
Lecture notes in computer science (2024), pp. 113-132
Closed Access | Times Cited: 1

Answer-Based Entity Extraction and Alignment for Visual Text Question Answering
Jun Yu, Mohan Jing, W Liu, et al.
(2023), pp. 9487-9491
Closed Access | Times Cited: 3

Application Research of Large Language Models in Medicine: Status, Problems, and Future
Guangcheng Ao, Guangyi Wang, Yong Chen, et al.
(2023), pp. 734-739
Closed Access | Times Cited: 2

Enhancing Multimodal Understanding With LIUS
Chunlai Song
Journal of Organizational and End User Computing (2024) Vol. 36, Iss. 1, pp. 1-17
Open Access

Enhancing machine vision: the impact of a novel innovative technology on video question-answering
Songjian Dan, Wei Feng
Soft Computing (2024) Vol. 28, Iss. 11-12, pp. 6969-6982
Closed Access

Red Teaming for Multimodal Large Language Models: A Survey
Moushumi Mahato, Avinash Kumar, Kartikey Singh, et al.
(2024)
Open Access

Prompting Large Language Models with Fine-Grained Visual Relations from Scene Graph for Visual Question Answering
Jiapeng Liu, Chengyang Fang, Liang Li, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 8125-8129
Closed Access

Llm Knowledge-Driven Target Prototype Learning for Few-Shot Segmentation
Pengfang Li, Fang Liu, Licheng Jiao, et al.
(2024)
Closed Access

Enhanced Qwen-VL 7B Model via Instruction Finetuning on Chinese Medical Dataset
Jianping Luo, Hanyi Yu, Cong Tan, et al.
(2024), pp. 526-530
Closed Access

HKFNet: Fine-Grained External Knowledge Fusion for Fact-Based Visual Question Answering
Bojin Li, Yan Sun, Xue Chen, et al.
2022 International Joint Conference on Neural Networks (IJCNN) (2024) Vol. 35, pp. 1-8
Closed Access

Zero-shot Video-based Visual Question Answering for Visually Impaired People
Ratnabali Pal, Samarjit Kar, Arif Ahmed Sekh
Research Square (Research Square) (2024)
Closed Access

Emergent Open-Vocabulary Semantic Segmentation from Off-the-Shelf Vision-Language Models
Jiayun Luo, Siddhesh Khandelwal, Leonid Sigal, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 32, pp. 4029-4040
Closed Access

Integrating IoT and visual question answering in smart cities: Enhancing educational outcomes
Tian Gao, G. Gary Wang
Alexandria Engineering Journal (2024) Vol. 108, pp. 878-888
Closed Access

Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge
Haibo Wang, Weifeng Ge
Lecture notes in computer science (2024), pp. 274-292
Closed Access

DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation
Rakshith Subramanyam, Kowshik Thopalli, Vivek Narayanaswamy, et al.
Lecture notes in computer science (2024), pp. 465-482
Closed Access

Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Models
Hao Cheng, Erjia Xiao, Jindong Gu, et al.
Lecture notes in computer science (2024), pp. 179-196
Closed Access

LLM-Sentry: A Model-Agnostic Human-in-the-Loop Framework for Securing Large Language Models
Saquib Irtiza, Khandakar Ashrafi Akbar, Arowa Yasmeen, et al.
(2024), pp. 245-254
Closed Access

Image-Based Criticality-Aware Fire Detection
Alina Arshad, Jawwad Ahmed Shamsi, Muhammad Burhan Khan, et al.
(2024), pp. 1-6
Closed Access

RSAdapter: Adapting Multimodal Models for Remote Sensing Visual Question Answering
Yuduo Wang, Pedram Ghamisi
IEEE Transactions on Geoscience and Remote Sensing (2024) Vol. 62, pp. 1-13
Open Access

Beyond Segmentation: Road Network Generation with Multi-modal LLMs
Sumedh Rasal, Sanjay K. Boddhu
Lecture notes in networks and systems (2024), pp. 308-315
Closed Access

Situational Data Integration in Question Answering systems: a survey over two decades
María Helena Franciscatto, Luis C. E. Bona, Célio Trois, et al.
Knowledge and Information Systems (2024) Vol. 66, Iss. 10, pp. 5875-5918
Closed Access

Scroll to top