
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech
Andy T. Liu, Shang-Wen Li, Hung-yi Lee
IEEE/ACM Transactions on Audio Speech and Language Processing (2021) Vol. 29, pp. 2351-2366
Open Access | Times Cited: 267
Andy T. Liu, Shang-Wen Li, Hung-yi Lee
IEEE/ACM Transactions on Audio Speech and Language Processing (2021) Vol. 29, pp. 2351-2366
Open Access | Times Cited: 267
Showing 1-25 of 267 citing articles:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, et al.
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1505-1518
Open Access | Times Cited: 763
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, et al.
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1505-1518
Open Access | Times Cited: 763
SUPERB: Speech Processing Universal PERformance Benchmark
Shu-Wen Yang, Po-Han Chi, Yung-Sung Chuang, et al.
Interspeech 2022 (2021)
Open Access | Times Cited: 463
Shu-Wen Yang, Po-Han Chi, Yung-Sung Chuang, et al.
Interspeech 2022 (2021)
Open Access | Times Cited: 463
FLAVA: A Foundational Language And Vision Alignment Model
Amanpreet Singh, Ronghang Hu, Vedanuj Goswami, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 15617-15629
Open Access | Times Cited: 296
Amanpreet Singh, Ronghang Hu, Vedanuj Goswami, et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022), pp. 15617-15629
Open Access | Times Cited: 296
Self-Supervised Speech Representation Learning: A Review
Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, et al.
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1179-1210
Open Access | Times Cited: 204
Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, et al.
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1179-1210
Open Access | Times Cited: 204
w2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training
Yu-An Chung, Yu Zhang, Wei Han, et al.
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (2021)
Open Access | Times Cited: 184
Yu-An Chung, Yu Zhang, Wei Han, et al.
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (2021)
Open Access | Times Cited: 184
A review of deep learning techniques for speech processing
Ambuj Mehrish, Navonil Majumder, Rishabh Bharadwaj, et al.
Information Fusion (2023) Vol. 99, pp. 101869-101869
Open Access | Times Cited: 143
Ambuj Mehrish, Navonil Majumder, Rishabh Bharadwaj, et al.
Information Fusion (2023) Vol. 99, pp. 101869-101869
Open Access | Times Cited: 143
Transfer learning based physics-informed neural networks for solving inverse problems in engineering structures under different loading scenarios
Xu Chen, Ba Trung Cao, Yong Yuan, et al.
Computer Methods in Applied Mechanics and Engineering (2022) Vol. 405, pp. 115852-115852
Open Access | Times Cited: 114
Xu Chen, Ba Trung Cao, Yong Yuan, et al.
Computer Methods in Applied Mechanics and Engineering (2022) Vol. 405, pp. 115852-115852
Open Access | Times Cited: 114
Speech Emotion Recognition Using Self-Supervised Features
Edmilson Morais, Ron Hoory, Weizhong Zhu, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022), pp. 6922-6926
Open Access | Times Cited: 79
Edmilson Morais, Ron Hoory, Weizhong Zhu, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022), pp. 6922-6926
Open Access | Times Cited: 79
Audio self-supervised learning: A survey
Shuo Liu, Adria Mallol-Ragolta, Emilia Parada‐Cabaleiro, et al.
Patterns (2022) Vol. 3, Iss. 12, pp. 100616-100616
Open Access | Times Cited: 74
Shuo Liu, Adria Mallol-Ragolta, Emilia Parada‐Cabaleiro, et al.
Patterns (2022) Vol. 3, Iss. 12, pp. 100616-100616
Open Access | Times Cited: 74
Distilhubert: Speech Representation Learning by Layer-Wise Distillation of Hidden-Unit Bert
Heng-Jui Chang, Shu-Wen Yang, Hung-yi Lee
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022)
Open Access | Times Cited: 69
Heng-Jui Chang, Shu-Wen Yang, Hung-yi Lee
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022)
Open Access | Times Cited: 69
Nearest Neighbor-Based Contrastive Learning for Hyperspectral and LiDAR Data Classification
Meng Wang, Feng Gao, Junyu Dong, et al.
IEEE Transactions on Geoscience and Remote Sensing (2023) Vol. 61, pp. 1-16
Open Access | Times Cited: 48
Meng Wang, Feng Gao, Junyu Dong, et al.
IEEE Transactions on Geoscience and Remote Sensing (2023) Vol. 61, pp. 1-16
Open Access | Times Cited: 48
Keyword Transformer: A Self-Attention Model for Keyword Spotting
Axel Berg, Mark O’Connor, Miguel Tairum Cruz
Interspeech 2022 (2021)
Open Access | Times Cited: 92
Axel Berg, Mark O’Connor, Miguel Tairum Cruz
Interspeech 2022 (2021)
Open Access | Times Cited: 92
A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding
Yingzhi Wang, Abdelmoumene Boumadane, Abdelwahab Heba
arXiv (Cornell University) (2021)
Open Access | Times Cited: 81
Yingzhi Wang, Abdelmoumene Boumadane, Abdelwahab Heba
arXiv (Cornell University) (2021)
Open Access | Times Cited: 81
Assessing the State of Self-Supervised Human Activity Recognition Using Wearables
Harish Haresamudram, Irfan Essa, Thomas Plötz
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (2022) Vol. 6, Iss. 3, pp. 1-47
Open Access | Times Cited: 60
Harish Haresamudram, Irfan Essa, Thomas Plötz
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (2022) Vol. 6, Iss. 3, pp. 1-47
Open Access | Times Cited: 60
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, et al.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (2022)
Open Access | Times Cited: 53
Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, et al.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (2022)
Open Access | Times Cited: 53
Unispeech-Sat: Universal Speech Representation Learning With Speaker Aware Pre-Training
Sanyuan Chen, Yu Wu, Chengyi Wang, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022)
Open Access | Times Cited: 47
Sanyuan Chen, Yu Wu, Chengyi Wang, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022)
Open Access | Times Cited: 47
COCOA
Shohreh Deldari, Hao Xue, Aaqib Saeed, et al.
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (2022) Vol. 6, Iss. 3, pp. 1-28
Open Access | Times Cited: 45
Shohreh Deldari, Hao Xue, Aaqib Saeed, et al.
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (2022) Vol. 6, Iss. 3, pp. 1-28
Open Access | Times Cited: 45
Speaker Normalization for Self-Supervised Speech Emotion Recognition
Itai Gat, Hagai Aronowitz, Weizhong Zhu, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022)
Open Access | Times Cited: 43
Itai Gat, Hagai Aronowitz, Weizhong Zhu, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022)
Open Access | Times Cited: 43
ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet
Siddhant Arora, Siddharth Dalmia, Pavel Denisov, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022), pp. 7167-7171
Open Access | Times Cited: 42
Siddhant Arora, Siddharth Dalmia, Pavel Denisov, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022), pp. 7167-7171
Open Access | Times Cited: 42
Improving Automatic Speech Recognition Performance for Low-Resource Languages With Self-Supervised Models
Jing Zhao, Wei-Qiang Zhang
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1227-1241
Closed Access | Times Cited: 39
Jing Zhao, Wei-Qiang Zhang
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1227-1241
Closed Access | Times Cited: 39
STGATE: Spatial-temporal graph attention network with a transformer encoder for EEG-based emotion recognition
Jingcong Li, Weijian Pan, Haiyun Huang, et al.
Frontiers in Human Neuroscience (2023) Vol. 17
Open Access | Times Cited: 30
Jingcong Li, Weijian Pan, Haiyun Huang, et al.
Frontiers in Human Neuroscience (2023) Vol. 17
Open Access | Times Cited: 30
SpeechLM: Enhanced Speech Pre-Training With Unpaired Textual Data
Ziqiang Zhang, Sanyuan Chen, Long Zhou, et al.
IEEE/ACM Transactions on Audio Speech and Language Processing (2024) Vol. 32, pp. 2177-2187
Open Access | Times Cited: 14
Ziqiang Zhang, Sanyuan Chen, Long Zhou, et al.
IEEE/ACM Transactions on Audio Speech and Language Processing (2024) Vol. 32, pp. 2177-2187
Open Access | Times Cited: 14
A Survey on Time-Series Pre-Trained Models
Qianli Ma, Zhen Liu, Zhenjing Zheng, et al.
IEEE Transactions on Knowledge and Data Engineering (2024) Vol. 36, Iss. 12, pp. 7536-7555
Open Access | Times Cited: 13
Qianli Ma, Zhen Liu, Zhenjing Zheng, et al.
IEEE Transactions on Knowledge and Data Engineering (2024) Vol. 36, Iss. 12, pp. 7536-7555
Open Access | Times Cited: 13
Multi-agent deep reinforcement learning-based autonomous decision-making framework for community virtual power plants
Xiangyu Li, Fengji Luo, Chaojie Li
Applied Energy (2024) Vol. 360, pp. 122813-122813
Open Access | Times Cited: 8
Xiangyu Li, Fengji Luo, Chaojie Li
Applied Energy (2024) Vol. 360, pp. 122813-122813
Open Access | Times Cited: 8
DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization
Shaoshi Ling, Yuzong Liu
arXiv (Cornell University) (2020)
Open Access | Times Cited: 61
Shaoshi Ling, Yuzong Liu
arXiv (Cornell University) (2020)
Open Access | Times Cited: 61