OpenAlex Citation Counts

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone
Shraman Pramanick, Yale Song, Sayan Nag, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 5262-5274
Open Access | Times Cited: 23

Showing 23 citing articles:

An Outlook into the Future of Egocentric Vision
Chiara Plizzari, Gabriele Goletto, Antonino Furnari, et al.
International Journal of Computer Vision (2024) Vol. 132, Iss. 11, pp. 4880-4936
Open Access | Times Cited: 12

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Kristen Grauman, Andrew Westbury, Lorenzo Torresani, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 19383-19400
Closed Access | Times Cited: 10

Video ReCap: Recursive Captioning of Hour-Long Videos
Md Mohaiminul Islam, Ngan Ho, Xitong Yang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 18198-18208
Closed Access | Times Cited: 5

Fine-Tuning of 3D Hand Pose Estimation on HOI4D Dataset by Convolutional Neural Networks
Dinh Do Van, Van-Hung Le
Communications in computer and information science (2025), pp. 171-188
Closed Access

Tarsier2: Advancing Large Vision-Language Models from Detailed Video Descriptions to Comprehensive Video Understanding
Liping Yuan, Jiawei Wang, Haomiao Sun, et al.
(2025)
Closed Access

Bootstrapping Vision-Language Models for Frequency-Centric Self-Supervised Remote Physiological Measurement
Zijie Yue, Miaojing Shi, Hanli Wang, et al.
International Journal of Computer Vision (2025)
Closed Access

Summarize the Past to Predict the Future: Natural Language Descriptions of Context Boost Multimodal Object Interaction Anticipation
Razvan–George Pasca, Alexey Gavryushin, Muhammad Ameer Hamza, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. abs/2006.13256, pp. 18286-18296
Closed Access | Times Cited: 3

Step Differences in Instructional Video
Tushar Nagarajan, Lorenzo Torresani
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 18740-18750
Closed Access | Times Cited: 3

Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
Shraman Pramanick, Guangxing Han, Rui Hou, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 14076-14088
Closed Access | Times Cited: 3

TF-FAS: Twofold-Element Fine-Grained Semantic Guidance for Generalizable Face Anti-spoofing
Xudong Wang, Ke-Yue Zhang, Taiping Yao, et al.
Lecture notes in computer science (2024), pp. 148-168
Closed Access | Times Cited: 3

Visual-guided hierarchical iterative fusion for multi-modal video action recognition
Bingbing Zhang, Ying Zhang, Jianxin Zhang, et al.
Pattern Recognition Letters (2024)
Closed Access | Times Cited: 2

STEPs: Self-Supervised Key Step Extraction and Localization from Unlabeled Procedural Videos
Anshul Shah, Benjamin Lundell, Harpreet Sawhney, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 10341-10353
Open Access | Times Cited: 6

Empirical study of 3D-HPE on HOI4D egocentric vision dataset based on deep learning
Van Hung Le
International Journal of Advances in Intelligent Informatics (2024) Vol. 10, Iss. 2, pp. 265-265
Open Access | Times Cited: 1

A Sound Approach: Using Large Language Models to Generate Audio Descriptions for Egocentric Text-Audio Retrieval
Andreea-Maria Oncescu, João F. Henriques, Andrew Zisserman, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 7300-7304
Open Access

Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding
Minh Tran, Yelin Kim, Che-Chun Su, et al.
Lecture notes in computer science (2024), pp. 1-19
Closed Access

ViLA: Efficient Video-Language Alignment for Video Question Answering
Xijun Wang, Junbang Liang, Chun-Kai Wang, et al.
Lecture notes in computer science (2024), pp. 186-204
Closed Access

Every Shot Counts: Using Exemplars for Repetition Counting in Videos
Saptarshi Sinha, Alexandros Stergiou, Dima Damen
Lecture notes in computer science (2024), pp. 384-402
Closed Access

A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives
Simone Peirone, Francesca Pistilli, Antonio Alliegro, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 18275-18285
Closed Access

Learning to Segment Referred Objects from Narrated Egocentric Videos
Yuhan Shen, Huiyu Wang, Xitong Yang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 14510-14520
Closed Access

PALM: Predicting Actions through Language Models
Sanghwan Kim, Daoji Huang, Yongqin Xian, et al.
Lecture notes in computer science (2024), pp. 140-158
Closed Access

LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning
Bolin Lai, Xiaoliang Dai, Lawrence R. Chen, et al.
Lecture notes in computer science (2024), pp. 135-155
Closed Access

AFF-ttention! Affordances and Attention Models for Short-Term Object Interaction Anticipation
Lorenzo Mur-Labadia, Rubén Martínez-Cantín, J.J. Guerrero, et al.
Lecture notes in computer science (2024), pp. 167-184
Closed Access

EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval
Thomas Hummel, Shyamgopal Karthik, Mariana-Iuliana Georgescu, et al.
Lecture notes in computer science (2024), pp. 1-17
Closed Access

Page 1

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-functional	1 year	The cookie is set by the GDPR Cookie Consent plugin to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.

Requested Article:

Showing 23 citing articles:

Your Privacy