
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
CogView: Mastering Text-to-Image Generation via Transformers
Ming Ding, Zhuoyi Yang, Wenyi Hong, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 261
Ming Ding, Zhuoyi Yang, Wenyi Hong, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 261
Showing 1-25 of 261 citing articles:
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach, Andreas Blattmann, Dominik Lorenz, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 10674-10685
Open Access | Times Cited: 6226
Robin Rombach, Andreas Blattmann, Dominik Lorenz, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 10674-10685
Open Access | Times Cited: 6226
Swin Transformer V2: Scaling Up Capacity and Resolution
Ze Liu, Han Hu, Yutong Lin, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 11999-12009
Open Access | Times Cited: 1231
Ze Liu, Han Hu, Yutong Lin, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 11999-12009
Open Access | Times Cited: 1231
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz, Yuanzhen Li, Varun Jampani, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 22500-22510
Open Access | Times Cited: 1032
Nataniel Ruiz, Yuanzhen Li, Varun Jampani, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 22500-22510
Open Access | Times Cited: 1032
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, et al.
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1505-1518
Open Access | Times Cited: 772
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, et al.
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1505-1518
Open Access | Times Cited: 772
A survey of transformers
Tianyang Lin, Yuxin Wang, Xiangyang Liu, et al.
AI Open (2022) Vol. 3, pp. 111-132
Open Access | Times Cited: 656
Tianyang Lin, Yuxin Wang, Xiangyang Liu, et al.
AI Open (2022) Vol. 3, pp. 111-132
Open Access | Times Cited: 656
Pre-trained models: Past, present and future
Xu Han, Zhengyan Zhang, Ning Ding, et al.
AI Open (2021) Vol. 2, pp. 225-250
Open Access | Times Cited: 636
Xu Han, Zhengyan Zhang, Ning Ding, et al.
AI Open (2021) Vol. 2, pp. 225-250
Open Access | Times Cited: 636
Blended Diffusion for Text-driven Editing of Natural Images
Omri Avrahami, Dani Lischinski, Ohad Fried
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 18187-18197
Open Access | Times Cited: 412
Omri Avrahami, Dani Lischinski, Ohad Fried
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 18187-18197
Open Access | Times Cited: 412
Vector Quantized Diffusion Model for Text-to-Image Synthesis
Shuyang Gu, Dong Chen, Jianmin Bao, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 10686-10696
Open Access | Times Cited: 375
Shuyang Gu, Dong Chen, Jianmin Bao, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 10686-10696
Open Access | Times Cited: 375
Multimodal Learning With Transformers: A Survey
Peng Xu, Xiatian Zhu, David A. Clifton
IEEE Transactions on Pattern Analysis and Machine Intelligence (2023) Vol. 45, Iss. 10, pp. 12113-12132
Open Access | Times Cited: 343
Peng Xu, Xiatian Zhu, David A. Clifton
IEEE Transactions on Pattern Analysis and Machine Intelligence (2023) Vol. 45, Iss. 10, pp. 12113-12132
Open Access | Times Cited: 343
T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion Models
Chong Mou, Xintao Wang, Liangbin Xie, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2024) Vol. 38, Iss. 5, pp. 4296-4304
Open Access | Times Cited: 294
Chong Mou, Xintao Wang, Liangbin Xie, et al.
Proceedings of the AAAI Conference on Artificial Intelligence (2024) Vol. 38, Iss. 5, pp. 4296-4304
Open Access | Times Cited: 294
GLIGEN: Open-Set Grounded Text-to-Image Generation
Yuheng Li, Haotian Liu, Qingyang Wu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 22511-22521
Open Access | Times Cited: 227
Yuheng Li, Haotian Liu, Qingyang Wu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 22511-22521
Open Access | Times Cited: 227
The Power of Generative AI: A Review of Requirements, Models, Input–Output Formats, Evaluation Metrics, and Challenges
Ajay Bandi, Pydi Venkata Satya Ramesh Adapa, Yudu Eswar Vinay Pratap Kumar Kuchi
Future Internet (2023) Vol. 15, Iss. 8, pp. 260-260
Open Access | Times Cited: 226
Ajay Bandi, Pydi Venkata Satya Ramesh Adapa, Yudu Eswar Vinay Pratap Kumar Kuchi
Future Internet (2023) Vol. 15, Iss. 8, pp. 260-260
Open Access | Times Cited: 226
Scaling up GANs for Text-to-Image Synthesis
Minguk Kang, Jun-Yan Zhu, Richard Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 10124-10134
Open Access | Times Cited: 217
Minguk Kang, Jun-Yan Zhu, Richard Zhang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 10124-10134
Open Access | Times Cited: 217
Pros and cons of GAN evaluation measures: New developments
Ali Borji
Computer Vision and Image Understanding (2021) Vol. 215, pp. 103329-103329
Open Access | Times Cited: 213
Ali Borji
Computer Vision and Image Understanding (2021) Vol. 215, pp. 103329-103329
Open Access | Times Cited: 213
Zero-shot Image-to-Image Translation
Gaurav Parmar, Krishna Kumar Singh, Richard Zhang, et al.
(2023), pp. 1-11
Open Access | Times Cited: 191
Gaurav Parmar, Krishna Kumar Singh, Richard Zhang, et al.
(2023), pp. 1-11
Open Access | Times Cited: 191
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
Ming Tao, Hao Tang, Fei Wu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 16494-16504
Open Access | Times Cited: 186
Ming Tao, Hao Tang, Fei Wu, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 16494-16504
Open Access | Times Cited: 186
Compute Trends Across Three Eras of Machine Learning
Jaime Sevilla, Lennart Heim, Anson Ho, et al.
2022 International Joint Conference on Neural Networks (IJCNN) (2022), pp. 1-8
Open Access | Times Cited: 170
Jaime Sevilla, Lennart Heim, Anson Ho, et al.
2022 International Joint Conference on Neural Networks (IJCNN) (2022), pp. 1-8
Open Access | Times Cited: 170
FILIP: Fine-grained Interactive Language-Image Pre-Training
Lewei Yao, Runhui Huang, Lu Hou, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 162
Lewei Yao, Runhui Huang, Lu Hou, et al.
arXiv (Cornell University) (2021)
Open Access | Times Cited: 162
Blended Latent Diffusion
Omri Avrahami, Ohad Fried, Dani Lischinski
ACM Transactions on Graphics (2023) Vol. 42, Iss. 4, pp. 1-11
Open Access | Times Cited: 155
Omri Avrahami, Ohad Fried, Dani Lischinski
ACM Transactions on Graphics (2023) Vol. 42, Iss. 4, pp. 1-11
Open Access | Times Cited: 155
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
Mingdeng Cao, Xintao Wang, Zhongang Qi, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 22503-22513
Open Access | Times Cited: 128
Mingdeng Cao, Xintao Wang, Zhongang Qi, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 22503-22513
Open Access | Times Cited: 128
Towards Language-Free Training for Text-to-Image Generation
Yufan Zhou, Ruiyi Zhang, Changyou Chen, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 17886-17896
Closed Access | Times Cited: 101
Yufan Zhou, Ruiyi Zhang, Changyou Chen, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 17886-17896
Closed Access | Times Cited: 101
Diffsound: Discrete Diffusion Model for Text-to-Sound Generation
Dongchao Yang, Jianwei Yu, Helin Wang, et al.
IEEE/ACM Transactions on Audio Speech and Language Processing (2023) Vol. 31, pp. 1720-1733
Open Access | Times Cited: 98
Dongchao Yang, Jianwei Yu, Helin Wang, et al.
IEEE/ACM Transactions on Audio Speech and Language Processing (2023) Vol. 31, pp. 1720-1733
Open Access | Times Cited: 98
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Yuxiang Wei, Yabo Zhang, Zhilong Ji, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 15897-15907
Open Access | Times Cited: 98
Yuxiang Wei, Yabo Zhang, Zhilong Ji, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 15897-15907
Open Access | Times Cited: 98
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion
Chenfei Wu, Jian Liang, Lei Ji, et al.
Lecture notes in computer science (2022), pp. 720-736
Open Access | Times Cited: 94
Chenfei Wu, Jian Liang, Lei Ji, et al.
Lecture notes in computer science (2022), pp. 720-736
Open Access | Times Cited: 94
SpaText: Spatio-Textual Representation for Controllable Image Generation
Omri Avrahami, Thomas Hayes, Oran Gafni, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 18370-18380
Open Access | Times Cited: 83
Omri Avrahami, Thomas Hayes, Oran Gafni, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2023), pp. 18370-18380
Open Access | Times Cited: 83