
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
Simple Open-Vocabulary Object Detection
Matthias Minderer, Alexey A. Gritsenko, Austin Stone, et al.
Lecture notes in computer science (2022), pp. 728-755
Closed Access | Times Cited: 109
Matthias Minderer, Alexey A. Gritsenko, Austin Stone, et al.
Lecture notes in computer science (2022), pp. 728-755
Closed Access | Times Cited: 109
Showing 1-25 of 109 citing articles:
Grounding DINO: Marrying DINO with Grounded Pre-training for Open-Set Object Detection
Shilong Liu, Zhaoyang Zeng, Tianhe Ren, et al.
Lecture notes in computer science (2024), pp. 38-55
Closed Access | Times Cited: 285
Shilong Liu, Zhaoyang Zeng, Tianhe Ren, et al.
Lecture notes in computer science (2024), pp. 38-55
Closed Access | Times Cited: 285
Vision-Language Models for Vision Tasks: A Survey
J Zhang, Jiaxing Huang, Sheng Jin, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 8, pp. 5625-5644
Open Access | Times Cited: 111
J Zhang, Jiaxing Huang, Sheng Jin, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 8, pp. 5625-5644
Open Access | Times Cited: 111
ViperGPT: Visual Inference via Python Execution for Reasoning
Dídac Surís, Sachit Menon, Carl Vondrick
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 11854-11864
Open Access | Times Cited: 84
Dídac Surís, Sachit Menon, Carl Vondrick
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 11854-11864
Open Access | Times Cited: 84
A Survey on Multimodal Large Language Models for Autonomous Driving
Can Cui, Yunsheng Ma, Xu Cao, et al.
(2024), pp. 958-979
Open Access | Times Cited: 83
Can Cui, Yunsheng Ma, Xu Cao, et al.
(2024), pp. 958-979
Open Access | Times Cited: 83
Sigmoid Loss for Language Image Pre-Training
Xiaohua Zhai, Basil Mustafa, А. И. Колесников, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023)
Open Access | Times Cited: 76
Xiaohua Zhai, Basil Mustafa, А. И. Колесников, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023)
Open Access | Times Cited: 76
TidyBot: personalized robot assistance with large language models
Jimmy Wu, Rika Antonova, Adam Kan, et al.
Autonomous Robots (2023) Vol. 47, Iss. 8, pp. 1087-1102
Closed Access | Times Cited: 59
Jimmy Wu, Rika Antonova, Adam Kan, et al.
Autonomous Robots (2023) Vol. 47, Iss. 8, pp. 1087-1102
Closed Access | Times Cited: 59
Semantic anomaly detection with large language models
Amine Elhafsi, Rohan Sinha, Christopher Agia, et al.
Autonomous Robots (2023) Vol. 47, Iss. 8, pp. 1035-1055
Closed Access | Times Cited: 32
Amine Elhafsi, Rohan Sinha, Christopher Agia, et al.
Autonomous Robots (2023) Vol. 47, Iss. 8, pp. 1035-1055
Closed Access | Times Cited: 32
LLM Multimodal Traffic Accident Forecasting
I. de Zarzà, J. de Curtò, Gemma Roig, et al.
Sensors (2023) Vol. 23, Iss. 22, pp. 9225-9225
Open Access | Times Cited: 28
I. de Zarzà, J. de Curtò, Gemma Roig, et al.
Sensors (2023) Vol. 23, Iss. 22, pp. 9225-9225
Open Access | Times Cited: 28
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie, Wei Li, Xiangtai Li, et al.
International Journal of Computer Vision (2024)
Closed Access | Times Cited: 12
Jiahao Xie, Wei Li, Xiangtai Li, et al.
International Journal of Computer Vision (2024)
Closed Access | Times Cited: 12
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding
Xingxing Zuo, Pouya Samangouei, Yunwen Zhou, et al.
International Journal of Computer Vision (2024)
Closed Access | Times Cited: 9
Xingxing Zuo, Pouya Samangouei, Yunwen Zhou, et al.
International Journal of Computer Vision (2024)
Closed Access | Times Cited: 9
A survey on integration of large language models with intelligent robots
Yeseung Kim, Dohyun Kim, Ji‐Eun Choi, et al.
Intelligent Service Robotics (2024) Vol. 17, Iss. 5, pp. 1091-1107
Open Access | Times Cited: 9
Yeseung Kim, Dohyun Kim, Ji‐Eun Choi, et al.
Intelligent Service Robotics (2024) Vol. 17, Iss. 5, pp. 1091-1107
Open Access | Times Cited: 9
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation
Yue Han, Jiangning Zhang, Yabiao Wang, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 12, pp. 9221-9238
Open Access | Times Cited: 8
Yue Han, Jiangning Zhang, Yabiao Wang, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 12, pp. 9221-9238
Open Access | Times Cited: 8
Can an Embodied Agent Find Your “Cat-shaped Mug”? LLM-Based Zero-Shot Object Navigation
Vishnu Sashank Dorbala, James F. Mullen, Dinesh Manocha
IEEE Robotics and Automation Letters (2023) Vol. 9, Iss. 5, pp. 4083-4090
Open Access | Times Cited: 19
Vishnu Sashank Dorbala, James F. Mullen, Dinesh Manocha
IEEE Robotics and Automation Letters (2023) Vol. 9, Iss. 5, pp. 4083-4090
Open Access | Times Cited: 19
Going Denser with Open-Vocabulary Part Segmentation
Peize Sun, Shoufa Chen, Chenchen Zhu, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 15407-15419
Open Access | Times Cited: 19
Peize Sun, Shoufa Chen, Chenchen Zhu, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 15407-15419
Open Access | Times Cited: 19
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
Aviad Aberdam, David Bensaïd, Alona Golts, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 21649-21660
Open Access | Times Cited: 14
Aviad Aberdam, David Bensaïd, Alona Golts, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 21649-21660
Open Access | Times Cited: 14
LP-OVOD: Open-Vocabulary Object Detection by Linear Probing
Chau Pham, Truong V. Vu, Khoi Nguyen
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024), pp. 768-777
Open Access | Times Cited: 5
Chau Pham, Truong V. Vu, Khoi Nguyen
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024), pp. 768-777
Open Access | Times Cited: 5
Clio: Real-Time Task-Driven Open-Set 3D Scene Graphs
Dominic Maggio, Yun Chang, Nathan Hughes, et al.
IEEE Robotics and Automation Letters (2024) Vol. 9, Iss. 10, pp. 8921-8928
Closed Access | Times Cited: 5
Dominic Maggio, Yun Chang, Nathan Hughes, et al.
IEEE Robotics and Automation Letters (2024) Vol. 9, Iss. 10, pp. 8921-8928
Closed Access | Times Cited: 5
YOLOFLY: A Consumer-Centric Framework for Efficient Object Detection in UAV Imagery
Pengwei Ma, Hongmei Fei, Dingyi Jia, et al.
Electronics (2025) Vol. 14, Iss. 3, pp. 498-498
Open Access
Pengwei Ma, Hongmei Fei, Dingyi Jia, et al.
Electronics (2025) Vol. 14, Iss. 3, pp. 498-498
Open Access
LLM Multi-agent Decision Optimization
J. de Curtò, I. de Zarzà, Carlos T. Calafate
Smart innovation, systems and technologies (2025), pp. 3-15
Closed Access
J. de Curtò, I. de Zarzà, Carlos T. Calafate
Smart innovation, systems and technologies (2025), pp. 3-15
Closed Access
A Review: One-Shot Object Detection Methods for Conditional Detection of Retail and Warehouse Products
Matthieu Desmarescaux, Wissam Kaddah, Ayman Alfalou, et al.
Neural Processing Letters (2025) Vol. 57, Iss. 2
Open Access
Matthieu Desmarescaux, Wissam Kaddah, Ayman Alfalou, et al.
Neural Processing Letters (2025) Vol. 57, Iss. 2
Open Access
SiamYOLOv8: a rapid conditional detection framework for one-shot object detection
Matthieu Desmarescaux, Wissam Kaddah, Ayman Alfalou, et al.
Applied Intelligence (2025) Vol. 55, Iss. 7
Closed Access
Matthieu Desmarescaux, Wissam Kaddah, Ayman Alfalou, et al.
Applied Intelligence (2025) Vol. 55, Iss. 7
Closed Access
Real-world robot applications of foundation models: a review
Kento Kawaharazuka, Tatsuya Matsushima, Andrew Gambardella, et al.
Advanced Robotics (2024) Vol. 38, Iss. 18, pp. 1232-1254
Open Access | Times Cited: 4
Kento Kawaharazuka, Tatsuya Matsushima, Andrew Gambardella, et al.
Advanced Robotics (2024) Vol. 38, Iss. 18, pp. 1232-1254
Open Access | Times Cited: 4
Vision-Language Pretraining for Variable-Shot Image Classification
Sotirios Papadopoulos, Konstantinos Ioannidis, Stefanos Vrochidis, et al.
Lecture notes in computer science (2025), pp. 283-297
Closed Access
Sotirios Papadopoulos, Konstantinos Ioannidis, Stefanos Vrochidis, et al.
Lecture notes in computer science (2025), pp. 283-297
Closed Access
Exploring Decision Transformer for Highway Automated Driving
Luca Forneris, Francesco Bellotti, Riccardo Berta, et al.
Lecture notes in electrical engineering (2025), pp. 123-130
Closed Access
Luca Forneris, Francesco Bellotti, Riccardo Berta, et al.
Lecture notes in electrical engineering (2025), pp. 123-130
Closed Access
Enhancing skin lesion classification: a CNN approach with human baseline comparison
Deep Ajabani, Zaffar Ahmed Shaikh, Amr Yousef, et al.
PeerJ Computer Science (2025) Vol. 11, pp. e2795-e2795
Open Access
Deep Ajabani, Zaffar Ahmed Shaikh, Amr Yousef, et al.
PeerJ Computer Science (2025) Vol. 11, pp. e2795-e2795
Open Access