
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone
Shraman Pramanick, Yale Song, Sayan Nag, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 5262-5274
Open Access | Times Cited: 23
Shraman Pramanick, Yale Song, Sayan Nag, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 5262-5274
Open Access | Times Cited: 23
Showing 23 citing articles:
An Outlook into the Future of Egocentric Vision
Chiara Plizzari, Gabriele Goletto, Antonino Furnari, et al.
International Journal of Computer Vision (2024) Vol. 132, Iss. 11, pp. 4880-4936
Open Access | Times Cited: 12
Chiara Plizzari, Gabriele Goletto, Antonino Furnari, et al.
International Journal of Computer Vision (2024) Vol. 132, Iss. 11, pp. 4880-4936
Open Access | Times Cited: 12
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Kristen Grauman, Andrew Westbury, Lorenzo Torresani, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 19383-19400
Closed Access | Times Cited: 10
Kristen Grauman, Andrew Westbury, Lorenzo Torresani, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 19383-19400
Closed Access | Times Cited: 10
Video ReCap: Recursive Captioning of Hour-Long Videos
Md Mohaiminul Islam, Ngan Ho, Xitong Yang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 18198-18208
Closed Access | Times Cited: 5
Md Mohaiminul Islam, Ngan Ho, Xitong Yang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 18198-18208
Closed Access | Times Cited: 5
Fine-Tuning of 3D Hand Pose Estimation on HOI4D Dataset by Convolutional Neural Networks
Dinh Do Van, Van-Hung Le
Communications in computer and information science (2025), pp. 171-188
Closed Access
Dinh Do Van, Van-Hung Le
Communications in computer and information science (2025), pp. 171-188
Closed Access
Tarsier2: Advancing Large Vision-Language Models from Detailed Video Descriptions to Comprehensive Video Understanding
Liping Yuan, Jiawei Wang, Haomiao Sun, et al.
(2025)
Closed Access
Liping Yuan, Jiawei Wang, Haomiao Sun, et al.
(2025)
Closed Access
Bootstrapping Vision-Language Models for Frequency-Centric Self-Supervised Remote Physiological Measurement
Zijie Yue, Miaojing Shi, Hanli Wang, et al.
International Journal of Computer Vision (2025)
Closed Access
Zijie Yue, Miaojing Shi, Hanli Wang, et al.
International Journal of Computer Vision (2025)
Closed Access
Summarize the Past to Predict the Future: Natural Language Descriptions of Context Boost Multimodal Object Interaction Anticipation
Razvan–George Pasca, Alexey Gavryushin, Muhammad Ameer Hamza, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. abs/2006.13256, pp. 18286-18296
Closed Access | Times Cited: 3
Razvan–George Pasca, Alexey Gavryushin, Muhammad Ameer Hamza, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. abs/2006.13256, pp. 18286-18296
Closed Access | Times Cited: 3
Step Differences in Instructional Video
Tushar Nagarajan, Lorenzo Torresani
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 18740-18750
Closed Access | Times Cited: 3
Tushar Nagarajan, Lorenzo Torresani
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 18740-18750
Closed Access | Times Cited: 3
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
Shraman Pramanick, Guangxing Han, Rui Hou, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 14076-14088
Closed Access | Times Cited: 3
Shraman Pramanick, Guangxing Han, Rui Hou, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 33, pp. 14076-14088
Closed Access | Times Cited: 3
TF-FAS: Twofold-Element Fine-Grained Semantic Guidance for Generalizable Face Anti-spoofing
Xudong Wang, Ke-Yue Zhang, Taiping Yao, et al.
Lecture notes in computer science (2024), pp. 148-168
Closed Access | Times Cited: 3
Xudong Wang, Ke-Yue Zhang, Taiping Yao, et al.
Lecture notes in computer science (2024), pp. 148-168
Closed Access | Times Cited: 3
Visual-guided hierarchical iterative fusion for multi-modal video action recognition
Bingbing Zhang, Ying Zhang, Jianxin Zhang, et al.
Pattern Recognition Letters (2024)
Closed Access | Times Cited: 2
Bingbing Zhang, Ying Zhang, Jianxin Zhang, et al.
Pattern Recognition Letters (2024)
Closed Access | Times Cited: 2
STEPs: Self-Supervised Key Step Extraction and Localization from Unlabeled Procedural Videos
Anshul Shah, Benjamin Lundell, Harpreet Sawhney, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 10341-10353
Open Access | Times Cited: 6
Anshul Shah, Benjamin Lundell, Harpreet Sawhney, et al.
2021 IEEE/CVF International Conference on Computer Vision (ICCV) (2023), pp. 10341-10353
Open Access | Times Cited: 6
Empirical study of 3D-HPE on HOI4D egocentric vision dataset based on deep learning
Van Hung Le
International Journal of Advances in Intelligent Informatics (2024) Vol. 10, Iss. 2, pp. 265-265
Open Access | Times Cited: 1
Van Hung Le
International Journal of Advances in Intelligent Informatics (2024) Vol. 10, Iss. 2, pp. 265-265
Open Access | Times Cited: 1
A Sound Approach: Using Large Language Models to Generate Audio Descriptions for Egocentric Text-Audio Retrieval
Andreea-Maria Oncescu, João F. Henriques, Andrew Zisserman, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 7300-7304
Open Access
Andreea-Maria Oncescu, João F. Henriques, Andrew Zisserman, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2024), pp. 7300-7304
Open Access
Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding
Minh Tran, Yelin Kim, Che-Chun Su, et al.
Lecture notes in computer science (2024), pp. 1-19
Closed Access
Minh Tran, Yelin Kim, Che-Chun Su, et al.
Lecture notes in computer science (2024), pp. 1-19
Closed Access
ViLA: Efficient Video-Language Alignment for Video Question Answering
Xijun Wang, Junbang Liang, Chun-Kai Wang, et al.
Lecture notes in computer science (2024), pp. 186-204
Closed Access
Xijun Wang, Junbang Liang, Chun-Kai Wang, et al.
Lecture notes in computer science (2024), pp. 186-204
Closed Access
Every Shot Counts: Using Exemplars for Repetition Counting in Videos
Saptarshi Sinha, Alexandros Stergiou, Dima Damen
Lecture notes in computer science (2024), pp. 384-402
Closed Access
Saptarshi Sinha, Alexandros Stergiou, Dima Damen
Lecture notes in computer science (2024), pp. 384-402
Closed Access
A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives
Simone Peirone, Francesca Pistilli, Antonio Alliegro, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 18275-18285
Closed Access
Simone Peirone, Francesca Pistilli, Antonio Alliegro, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024), pp. 18275-18285
Closed Access
Learning to Segment Referred Objects from Narrated Egocentric Videos
Yuhan Shen, Huiyu Wang, Xitong Yang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 14510-14520
Closed Access
Yuhan Shen, Huiyu Wang, Xitong Yang, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2024) Vol. 35, pp. 14510-14520
Closed Access
PALM: Predicting Actions through Language Models
Sanghwan Kim, Daoji Huang, Yongqin Xian, et al.
Lecture notes in computer science (2024), pp. 140-158
Closed Access
Sanghwan Kim, Daoji Huang, Yongqin Xian, et al.
Lecture notes in computer science (2024), pp. 140-158
Closed Access
LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning
Bolin Lai, Xiaoliang Dai, Lawrence R. Chen, et al.
Lecture notes in computer science (2024), pp. 135-155
Closed Access
Bolin Lai, Xiaoliang Dai, Lawrence R. Chen, et al.
Lecture notes in computer science (2024), pp. 135-155
Closed Access
AFF-ttention! Affordances and Attention Models for Short-Term Object Interaction Anticipation
Lorenzo Mur-Labadia, Rubén Martínez-Cantín, J.J. Guerrero, et al.
Lecture notes in computer science (2024), pp. 167-184
Closed Access
Lorenzo Mur-Labadia, Rubén Martínez-Cantín, J.J. Guerrero, et al.
Lecture notes in computer science (2024), pp. 167-184
Closed Access
EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval
Thomas Hummel, Shyamgopal Karthik, Mariana-Iuliana Georgescu, et al.
Lecture notes in computer science (2024), pp. 1-17
Closed Access
Thomas Hummel, Shyamgopal Karthik, Mariana-Iuliana Georgescu, et al.
Lecture notes in computer science (2024), pp. 1-17
Closed Access