OpenAlex Citation Counts

OpenAlex Citations Logo

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

Vector-Quantized Neural Networks for Acoustic Unit Discovery in the ZeroSpeech 2020 Challenge
Benjamin van Niekerk, Leanne Nortje, Herman Kamper
Interspeech 2022 (2020), pp. 4836-4840
Open Access | Times Cited: 109

Showing 1-25 of 109 citing articles:

Self-Supervised Speech Representation Learning: A Review
Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, et al.
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1179-1210
Open Access | Times Cited: 205

Speech Resynthesis from Discrete Disentangled Self-Supervised Representations
Adam Polyak, Yossi Adi, Jade Copet, et al.
Interspeech 2022 (2021)
Open Access | Times Cited: 153

Unsupervised Automatic Speech Recognition: A review
Hanan Aldarmaki, Asad Ullah, Sreepratha Ram, et al.
Speech Communication (2022) Vol. 139, pp. 76-91
Open Access | Times Cited: 54

A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
Benjamin van Niekerk, Marc‐André Carbonneau, Julian Zaïdi, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022)
Open Access | Times Cited: 53

Avqvc: One-Shot Voice Conversion By Vector Quantization With Applying Contrastive Learning
Huaizhen Tang, Xulong Zhang, Jianzong Wang, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022), pp. 4613-4617
Open Access | Times Cited: 38

Many but not all deep neural network audio models capture brain responses and exhibit correspondence between model stages and brain regions
Greta Tuckute, Jenelle Feather, Dana Boebinger, et al.
PLoS Biology (2023) Vol. 21, Iss. 12, pp. e3002366-e3002366
Open Access | Times Cited: 25

Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques
Grzegorz Chrupała
Journal of Artificial Intelligence Research (2022) Vol. 73, pp. 673-707
Open Access | Times Cited: 36

Superb @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning
Tzu-hsun Feng, Annie Dong, Ching-Feng Yeh, et al.
2022 IEEE Spoken Language Technology Workshop (SLT) (2023), pp. 1096-1103
Open Access | Times Cited: 17

Do Infants Really Learn Phonetic Categories?
Naomi H. Feldman, Sharon Goldwater, Emmanuel Dupoux, et al.
Open Mind (2021) Vol. 5, pp. 113-131
Open Access | Times Cited: 34

A whole brain probabilistic generative model: Toward realizing cognitive architectures for developmental robots
Tadahiro Taniguchi, Hiroshi Yamakawa, Takayuki Nagai, et al.
Neural Networks (2022) Vol. 150, pp. 293-312
Open Access | Times Cited: 24

Towards Unsupervised Phone and Word Segmentation Using Self-Supervised Vector-Quantized Neural Networks
Herman Kamper, Benjamin van Niekerk
Interspeech 2022 (2021), pp. 1539-1543
Open Access | Times Cited: 30

Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation
Saurabhchand Bhati, Jesús Villalba, Piotr Żelasko, et al.
Interspeech 2022 (2021), pp. 366-370
Open Access | Times Cited: 28

Employing Chroma-gram techniques for audio source separation in human-computer interaction
Liqaa Fadil, Alia Karim Abdul Hassan, Hiba B. Alwan
AIP conference proceedings (2025) Vol. 3264, pp. 030024-030024
Closed Access

USIAL-VC: A One-Shot Voice Conversion by U-Net-Based Encoder and Speaker Identity Adaptive Learning
Yujiang Peng, Yutian Wang
Communications in computer and information science (2025), pp. 134-145
Closed Access

Unsupervised Speech Segmentation and Variable Rate Representation Learning Using Segmental Contrastive Predictive Coding
Saurabhchand Bhati, Jesús Villalba, Piotr Żelasko, et al.
IEEE/ACM Transactions on Audio Speech and Language Processing (2022) Vol. 30, pp. 2002-2014
Open Access | Times Cited: 17

Self-Supervised Language Learning From Raw Audio: Lessons From the Zero Resource Speech Challenge
Ewan Dunbar, Nicolas Hamilakis, Emmanuel Dupoux
IEEE Journal of Selected Topics in Signal Processing (2022) Vol. 16, Iss. 6, pp. 1211-1226
Open Access | Times Cited: 17

Unsupervised Speech Recognition
Alexei Baevski, Wei-Ning Hsu, Alexis Conneau, et al.
arXiv (Cornell University) (2021)
Closed Access | Times Cited: 22

Adversarially Learning Disentangled Speech Representations for Robust Multi-Factor Voice Conversion
Jie Wang, Jingbei Li, Xintao Zhao, et al.
Interspeech 2022 (2021)
Open Access | Times Cited: 22

Word Segmentation on Discovered Phone Units With Dynamic Programming and Self-Supervised Scoring
Herman Kamper
IEEE/ACM Transactions on Audio Speech and Language Processing (2022) Vol. 31, pp. 684-694
Open Access | Times Cited: 15

Styletts-VC: One-Shot Voice Conversion by Knowledge Transfer From Style-Based TTS Models
Yinghao Aaron Li, Cong Han, Nima Mesgarani
2022 IEEE Spoken Language Technology Workshop (SLT) (2023), pp. 920-927
Open Access | Times Cited: 8

The Zero Resource Speech Challenge 2020: Discovering Discrete Subword and Word Units
Ewan Dunbar, Julien Karadayi, Mathieu Bernard, et al.
Interspeech 2022 (2020), pp. 4831-4835
Open Access | Times Cited: 23

End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions
Wonjune Kang, Mark Hasegawa–Johnson, Deb Roy
Interspeech 2022 (2023), pp. 2303-2307
Closed Access | Times Cited: 7

VCVTS: Multi-Speaker Video-to-Speech Synthesis Via Cross-Modal Knowledge Transfer from Voice Conversion
Disong Wang, Shan Yang, Dan Su, et al.
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2022), pp. 7252-7256
Open Access | Times Cited: 11

Page 1 - Next Page

Scroll to top