
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao, Yongming Rao, Zuyan Liu, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 5706-5716
Open Access | Times Cited: 72
Wenliang Zhao, Yongming Rao, Zuyan Liu, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 5706-5716
Open Access | Times Cited: 72
Showing 1-25 of 72 citing articles:
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Lihe Yang, Bingyi Kang, Zilong Huang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 10371-10381
Closed Access | Times Cited: 148
Lihe Yang, Bingyi Kang, Zilong Huang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 10371-10381
Closed Access | Times Cited: 148
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Bingxin Ke, Anton Obukhov, Shengyu Huang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 2, pp. 9492-9502
Closed Access | Times Cited: 34
Bingxin Ke, Anton Obukhov, Shengyu Huang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 2, pp. 9492-9502
Closed Access | Times Cited: 34
AttriDiffuser: Adversarially enhanced diffusion model for text-to-facial attribute image synthesis
Wenfeng Song, Zhongyong Ye, Meng Sun, et al.
Pattern Recognition (2025) Vol. 163, pp. 111447-111447
Closed Access | Times Cited: 1
Wenfeng Song, Zhongyong Ye, Meng Sun, et al.
Pattern Recognition (2025) Vol. 163, pp. 111447-111447
Closed Access | Times Cited: 1
Diffusion Model as Representation Learner
Xingyi Yang, Xinchao Wang
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 18892-18903
Open Access | Times Cited: 27
Xingyi Yang, Xinchao Wang
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 18892-18903
Open Access | Times Cited: 27
Deep Learning-Based Image and Video Inpainting: A Survey
Weize Quan, J. Chen, Yanli Liu, et al.
International Journal of Computer Vision (2024) Vol. 132, Iss. 7, pp. 2367-2400
Closed Access | Times Cited: 13
Weize Quan, J. Chen, Yanli Liu, et al.
International Journal of Computer Vision (2024) Vol. 132, Iss. 7, pp. 2367-2400
Closed Access | Times Cited: 13
DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation
Yuping Duan, Xianda Guo, Zheng Zhu
Lecture notes in computer science (2024), pp. 432-449
Closed Access | Times Cited: 12
Yuping Duan, Xianda Guo, Zheng Zhu
Lecture notes in computer science (2024), pp. 432-449
Closed Access | Times Cited: 12
Text-Image Alignment for Diffusion-Based Perception
Neehar Kondapaneni, Markus Marks, Manuel Knott, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 13883-13893
Closed Access | Times Cited: 10
Neehar Kondapaneni, Markus Marks, Manuel Knott, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 13883-13893
Closed Access | Times Cited: 10
Application of Machine Vision Techniques in Low-Cost Devices to Improve Efficiency in Precision Farming
Juan Felipe Jaramillo-Hernández, Vicente Julián, Cédric Marco-Detchart, et al.
Sensors (2024) Vol. 24, Iss. 3, pp. 937-937
Open Access | Times Cited: 8
Juan Felipe Jaramillo-Hernández, Vicente Julián, Cédric Marco-Detchart, et al.
Sensors (2024) Vol. 24, Iss. 3, pp. 937-937
Open Access | Times Cited: 8
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation
Suraj Patni, Aradhye Agarwal, Chetan Arora
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 27, pp. 28285-28295
Closed Access | Times Cited: 8
Suraj Patni, Aradhye Agarwal, Chetan Arora
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 27, pp. 28285-28295
Closed Access | Times Cited: 8
DMSA-UNet: Dual Multi-Scale Attention makes UNet more strong for medical image segmentation
Xiang Li, Chong Fu, Qun Wang, et al.
Knowledge-Based Systems (2024) Vol. 299, pp. 112050-112050
Closed Access | Times Cited: 7
Xiang Li, Chong Fu, Qun Wang, et al.
Knowledge-Based Systems (2024) Vol. 299, pp. 112050-112050
Closed Access | Times Cited: 7
VGDIFFZERO: Text-To-Image Diffusion Models Can Be Zero-Shot Visual Grounders
Xuyang Liu, Siteng Huang, Yachen Kang, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 2765-2769
Open Access | Times Cited: 6
Xuyang Liu, Siteng Huang, Yachen Kang, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 2765-2769
Open Access | Times Cited: 6
WorDepth: Variational Language Prior for Monocular Depth Estimation
Ziyao Zeng, Daniel Wang, Fengyu Yang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 27, pp. 9708-9719
Closed Access | Times Cited: 6
Ziyao Zeng, Daniel Wang, Fengyu Yang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 27, pp. 9708-9719
Closed Access | Times Cited: 6
PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
Zhenyu Li, Shariq Farooq Bhat, Peter Wonka
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 44, pp. 10016-10025
Closed Access | Times Cited: 6
Zhenyu Li, Shariq Farooq Bhat, Peter Wonka
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 44, pp. 10016-10025
Closed Access | Times Cited: 6
Text-to-Image Synthesis With Generative Models: Methods, Datasets, Performance Metrics, Challenges, and Future Direction
Sarah Alhabeeb, Amal A. Al-Shargabi
IEEE Access (2024) Vol. 12, pp. 24412-24427
Open Access | Times Cited: 5
Sarah Alhabeeb, Amal A. Al-Shargabi
IEEE Access (2024) Vol. 12, pp. 24412-24427
Open Access | Times Cited: 5
Generative Text-to-Image Diffusion for Automated Map Production Based on Geosocial Media Data
Alexander Dunkel, Dirk Burghardt, Madalina Gugulica
KN - Journal of Cartography and Geographic Information (2024) Vol. 74, Iss. 1, pp. 3-15
Open Access | Times Cited: 5
Alexander Dunkel, Dirk Burghardt, Madalina Gugulica
KN - Journal of Cartography and Geographic Information (2024) Vol. 74, Iss. 1, pp. 3-15
Open Access | Times Cited: 5
Mask Grounding for Referring Image Segmentation
Yong Xien Chng, Henry Zheng, Yizeng Han, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 26563-26573
Closed Access | Times Cited: 5
Yong Xien Chng, Henry Zheng, Yizeng Han, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 26563-26573
Closed Access | Times Cited: 5
Learning Vision from Models Rivals Learning Vision from Data
Yonglong Tian, Lijie Fan, Kaifeng Chen, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 15887-15898
Closed Access | Times Cited: 5
Yonglong Tian, Lijie Fan, Kaifeng Chen, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 15887-15898
Closed Access | Times Cited: 5
Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence
Junyi Zhang, Charles Herrmann, Junhwa Hur, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 2, pp. 3076-3085
Closed Access | Times Cited: 5
Junyi Zhang, Charles Herrmann, Junhwa Hur, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 2, pp. 3076-3085
Closed Access | Times Cited: 5
SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching
Xinghui Li, Jingyi Lu, Kai Han, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 34, pp. 27548-27558
Closed Access | Times Cited: 5
Xinghui Li, Jingyi Lu, Kai Han, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 34, pp. 27548-27558
Closed Access | Times Cited: 5
Probing the 3D Awareness of Visual Foundation Models
Mohamed El Banani, Amit Raj, Kevis-Kokitsi Maninis, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 1, pp. 21795-21806
Closed Access | Times Cited: 5
Mohamed El Banani, Amit Raj, Kevis-Kokitsi Maninis, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 1, pp. 21795-21806
Closed Access | Times Cited: 5
IEBins: Iterative Elastic Bins for Monocular Depth Estimation and Completion
Shuwei Shao, Zhongcai Pei, Weihai Chen, et al.
International Journal of Computer Vision (2024)
Closed Access | Times Cited: 5
Shuwei Shao, Zhongcai Pei, Weihai Chen, et al.
International Journal of Computer Vision (2024)
Closed Access | Times Cited: 5
Zero-Shot Co-Salient Object Detection Framework
Haoke Xiao, Lv Tang, Bo Li, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 4010-4014
Open Access | Times Cited: 4
Haoke Xiao, Lv Tang, Bo Li, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 4010-4014
Open Access | Times Cited: 4
PolyMaX: General Dense Prediction with Mask Transformer
Xuan Yang, Liangzhe Yuan, Michael J. Wilber, et al.
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024), pp. 1039-1050
Open Access | Times Cited: 4
Xuan Yang, Liangzhe Yuan, Michael J. Wilber, et al.
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2024), pp. 1039-1050
Open Access | Times Cited: 4
Exploring Multi-Timestep Multi-Stage Diffusion Features for Hyperspectral Image Classification
Jingyi Zhou, Jiamu Sheng, Peng Ye, et al.
IEEE Transactions on Geoscience and Remote Sensing (2024) Vol. 62, pp. 1-16
Open Access | Times Cited: 4
Jingyi Zhou, Jiamu Sheng, Peng Ye, et al.
IEEE Transactions on Geoscience and Remote Sensing (2024) Vol. 62, pp. 1-16
Open Access | Times Cited: 4
Learning Diffusion High-Quality Priors for Pan-sharpening: A Two-Stage Approach with Time-Aware Adapter Fine-Tuning
Yingying Wang, Yunlong Lin, Xuanhua He, et al.
IEEE Transactions on Geoscience and Remote Sensing (2025) Vol. 63, pp. 1-14
Closed Access
Yingying Wang, Yunlong Lin, Xuanhua He, et al.
IEEE Transactions on Geoscience and Remote Sensing (2025) Vol. 63, pp. 1-14
Closed Access