
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medical Visual Question Answering
Pengfei Li, Gang Liu, Jinlong He, et al.
Lecture notes in computer science (2023), pp. 374-383
Closed Access | Times Cited: 14
Pengfei Li, Gang Liu, Jinlong He, et al.
Lecture notes in computer science (2023), pp. 374-383
Closed Access | Times Cited: 14
Showing 14 citing articles:
Vision-language models for medical report generation and visual question answering: a review
Iryna Hartsock, Ghulam Rasool
Frontiers in Artificial Intelligence (2024) Vol. 7
Open Access | Times Cited: 15
Iryna Hartsock, Ghulam Rasool
Frontiers in Artificial Intelligence (2024) Vol. 7
Open Access | Times Cited: 15
Advancing medical imaging with language models: featuring a spotlight on ChatGPT
Mingzhe Hu, Joshua Qian, Shaoyan Pan, et al.
Physics in Medicine and Biology (2024) Vol. 69, Iss. 10, pp. 10TR01-10TR01
Open Access | Times Cited: 12
Mingzhe Hu, Joshua Qian, Shaoyan Pan, et al.
Physics in Medicine and Biology (2024) Vol. 69, Iss. 10, pp. 10TR01-10TR01
Open Access | Times Cited: 12
A Survey on Multimodal Large Language Models in Radiology for Report Generation and Visual Question Answering
Ziruo Yi, Ting Xiao, Mark V. Albert
Information (2025) Vol. 16, Iss. 2, pp. 136-136
Open Access
Ziruo Yi, Ting Xiao, Mark V. Albert
Information (2025) Vol. 16, Iss. 2, pp. 136-136
Open Access
Generative Models in Medical Visual Question Answering: A Survey
Wenjie Dong, Shuhao Shen, Yuqiang Han, et al.
Applied Sciences (2025) Vol. 15, Iss. 6, pp. 2983-2983
Open Access
Wenjie Dong, Shuhao Shen, Yuqiang Han, et al.
Applied Sciences (2025) Vol. 15, Iss. 6, pp. 2983-2983
Open Access
A Language-Guided Progressive Fusion Network with semantic density alignment for medical visual question answering
Shuxian Du, Shuang Liang, Yu Gu
Journal of Biomedical Informatics (2025), pp. 104811-104811
Closed Access
Shuxian Du, Shuang Liang, Yu Gu
Journal of Biomedical Informatics (2025), pp. 104811-104811
Closed Access
Pathologyvlm: a large vision-language model for pathology image understanding
Dawei Dai, Yuanhui Zhang, Qianlan Yang, et al.
Artificial Intelligence Review (2025) Vol. 58, Iss. 6
Open Access
Dawei Dai, Yuanhui Zhang, Qianlan Yang, et al.
Artificial Intelligence Review (2025) Vol. 58, Iss. 6
Open Access
Language Models Meet Anomaly Detection for Better Interpretability and Generalizability
Jun Li, Su Hwan Kim, Philip Müller, et al.
Lecture notes in computer science (2025), pp. 113-123
Closed Access
Jun Li, Su Hwan Kim, Philip Müller, et al.
Lecture notes in computer science (2025), pp. 113-123
Closed Access
TF-FAS: Twofold-Element Fine-Grained Semantic Guidance for Generalizable Face Anti-spoofing
Xudong Wang, Ke-Yue Zhang, Taiping Yao, et al.
Lecture notes in computer science (2024), pp. 148-168
Closed Access | Times Cited: 3
Xudong Wang, Ke-Yue Zhang, Taiping Yao, et al.
Lecture notes in computer science (2024), pp. 148-168
Closed Access | Times Cited: 3
Can LLMs’ Tuning Methods Work in Medical Multimodal Domain?
Jiawei Chen, Yue Jiang, Dingkang Yang, et al.
Lecture notes in computer science (2024), pp. 112-122
Closed Access | Times Cited: 3
Jiawei Chen, Yue Jiang, Dingkang Yang, et al.
Lecture notes in computer science (2024), pp. 112-122
Closed Access | Times Cited: 3
MISS: A Generative Pre-training and Fine-Tuning Approach for Med-VQA
Jiawei Chen, Dingkang Yang, Yue Jiang, et al.
Lecture notes in computer science (2024), pp. 299-313
Closed Access | Times Cited: 2
Jiawei Chen, Dingkang Yang, Yue Jiang, et al.
Lecture notes in computer science (2024), pp. 299-313
Closed Access | Times Cited: 2
RSMoDM: Multimodal Momentum Distillation Model for Remote Sensing Visual Question Answering
Pengfei Li, Gang Liu, Jinlong He, et al.
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (2024) Vol. 17, pp. 16799-16814
Open Access | Times Cited: 1
Pengfei Li, Gang Liu, Jinlong He, et al.
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (2024) Vol. 17, pp. 16799-16814
Open Access | Times Cited: 1
PECR:Parameter-Efficient Transfer Learning with Cross-Modal Representation Learning for Remote Sensing Visual Question Answering
Pengfei Li, Jinlong He, Gang Liu, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 6740-6744
Closed Access
Pengfei Li, Jinlong He, Gang Liu, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 6740-6744
Closed Access
C3Net: Compound Conditioned ControlNet for Multimodal Content Generation
Juntao Zhang, Yuehuai Liu, Yu‐Wing Tai, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 21, pp. 26876-26885
Closed Access
Juntao Zhang, Yuehuai Liu, Yu‐Wing Tai, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 21, pp. 26876-26885
Closed Access
Multimodal Foundation Models for Medical Imaging - A Systematic Review and Implementation Guidelines
Shih-Cheng Huang, Malte Jensen, Serena Yeung, et al.
medRxiv (Cold Spring Harbor Laboratory) (2024)
Open Access
Shih-Cheng Huang, Malte Jensen, Serena Yeung, et al.
medRxiv (Cold Spring Harbor Laboratory) (2024)
Open Access