OpenAlex Citation Counts

OpenAlex Citations Logo

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

GRES: Generalized Referring Expression Segmentation
Chang Liu, Henghui Ding, Xudong Jiang
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 55

Showing 1-25 of 55 citing articles:

Transformer-Based Visual Segmentation: A Survey
Xiangtai Li, Henghui Ding, Haobo Yuan, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 12, pp. 10138-10163
Open Access | Times Cited: 70

LISA: Reasoning Segmentation via Large Language Model
Xin Lai, Zhuotao Tian, Yukang Chen, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 9579-9589
Closed Access | Times Cited: 63

Towards Open Vocabulary Learning: A Survey
Jianzong Wu, Xiangtai Li, Shilin Xu, et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence (2024) Vol. 46, Iss. 7, pp. 5092-5113
Open Access | Times Cited: 45

GLaMM: Pixel Grounding Large Multimodal Model
Hanoona Rasheed, Muhammad Maaz, Sahal Shaji, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 13009-13018
Closed Access | Times Cited: 26

Multi-Modal Mutual Attention and Iterative Interaction for Referring Image Segmentation
Chang Liu, Henghui Ding, Yulun Zhang, et al.
IEEE Transactions on Image Processing (2023) Vol. 32, pp. 3054-3065
Open Access | Times Cited: 31

MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
Henghui Ding, Chang Liu, Shuting He, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 2694-2703
Open Access | Times Cited: 26

InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Zigang Geng, Binxin Yang, Tiankai Hang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 12709-12720
Closed Access | Times Cited: 14

Toward Robust Referring Image Segmentation
Jianzong Wu, Xiangtai Li, Xia Li, et al.
IEEE Transactions on Image Processing (2024) Vol. 33, pp. 1782-1794
Closed Access | Times Cited: 13

PixelLM: Pixel Reasoning with Large Multimodal Model
Zhongwei Ren, Zhicheng Huang, Yunchao Wei, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 26364-26373
Closed Access | Times Cited: 11

Primitive Generation and Semantic-Related Alignment for Universal Zero-Shot Segmentation
Shuting He, Henghui Ding, Wei Jiang
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 11238-11247
Open Access | Times Cited: 17

GSVA: Generalized Segmentation via Multimodal Large Language Models
Zhuofan Xia, Dongchen Han, Yizeng Han, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 3858-3869
Closed Access | Times Cited: 8

SmartEdit: Exploring Complex Instruction-Based Image Editing with Multimodal Large Language Models
Yuzhou Huang, Liangbin Xie, Xintao Wang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 8362-8371
Closed Access | Times Cited: 7

LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
Hao Zhang, Hongyang Li, Feng Li, et al.
Lecture notes in computer science (2024), pp. 19-35
Closed Access | Times Cited: 6

Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation
Shuting He, Henghui Ding, Wei Jiang
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023)
Open Access | Times Cited: 15

CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
Shuyang Sun, Runjia Li, Philip H. S. Torr, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 32, pp. 13171-13182
Closed Access | Times Cited: 6

Mask Grounding for Referring Image Segmentation
Yong Xien Chng, Henry Zheng, Yizeng Han, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 26563-26573
Closed Access | Times Cited: 5

Towards Language-Guided Visual Recognition via Dynamic Convolutions
Gen Luo, Yiyi Zhou, Xiaoshuai Sun, et al.
International Journal of Computer Vision (2023) Vol. 132, Iss. 1, pp. 1-19
Open Access | Times Cited: 13

Referring Image Editing: Object-Level Image Editing via Referring Expressions
Chang Liu, Xiangtai Li, Henghui Ding
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 13128-13138
Closed Access | Times Cited: 4

Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
Shuting He, Henghui Ding
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 13332-13341
Closed Access | Times Cited: 4

Ipdm: identity preserving diffusion model for face sketch and photo synthesis
Duoxun Tang, Xinhang Jiang, Ying Zhang, et al.
Machine Vision and Applications (2025) Vol. 36, Iss. 2
Closed Access

Bidirectional cascaded multimodal attention for multiple choice visual question answering
Sushmita Upadhyay, Sanjaya Shankar Tripathy
Machine Vision and Applications (2025) Vol. 36, Iss. 2
Closed Access

Understand and Detect: Multi-step zero-shot detection with image-level specific prompt
Miaotian Guo, Kewei Wu, Zhuqing Jiang, et al.
Knowledge-Based Systems (2025) Vol. 311, pp. 113083-113083
Closed Access

Decoding before aligning: Scale-Adaptive Early-Decoding Transformer for visual grounding
Liuwu Li, Yi Cai, Jiexin Wang, et al.
Neurocomputing (2025), pp. 129756-129756
Closed Access

DM 2 RM: dual-mode multimodal ranking for target objects and receptacles based on open-vocabulary instructions
Ryosuke Korekata, Kanta Kaneda, Shunya Nagashima, et al.
Advanced Robotics (2025), pp. 1-16
Closed Access

Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
Shraman Pramanick, Guangxing Han, Rui Hou, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 14076-14088
Closed Access | Times Cited: 3

Page 1 - Next Page

Scroll to top