OpenAlex Citation Counts

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

Can Foundation Models Wrangle Your Data?
Avanika Narayan, Ines Chami, Laurel Orr, et al.
Proceedings of the VLDB Endowment (2022) Vol. 16, Iss. 4, pp. 738-746
Open Access | Times Cited: 57

Showing 1-25 of 57 citing articles:

CancerGPT for few shot drug pair synergy prediction using large pretrained language models
Tianhao Li, Sandesh Shetty, Advaith Kamath, et al.
npj Digital Medicine (2024) Vol. 7, Iss. 1
Open Access | Times Cited: 38

Generative Pre-Trained Transformer (GPT) in Research: A Systematic Review on Data Augmentation
Fahim Sufi
Information (2024) Vol. 15, Iss. 2, pp. 99-99
Open Access | Times Cited: 25

Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Simran Arora, Brandon Yang, Sabri Eyuboglu, et al.
Proceedings of the VLDB Endowment (2023) Vol. 17, Iss. 2, pp. 92-105
Open Access | Times Cited: 24

Data cleaning and machine learning: a systematic literature review
Pierre-Olivier Côté, Amin Nikanjam, Nafisa Ahmed, et al.
Automated Software Engineering (2024) Vol. 31, Iss. 2
Closed Access | Times Cited: 12

Construction of Knowledge Graphs: Current State and Challenges
Marvin Hofer, Daniel Obraczka, Alieh Saeedi, et al.
Information (2024) Vol. 15, Iss. 8, pp. 509-509
Open Access | Times Cited: 8

To prompt or not to prompt: Navigating the use of large language models for integrating and modeling heterogeneous data
Adel Remadi, Karim El Hage, Yasmina Hobeika, et al.
Data & Knowledge Engineering (2024) Vol. 152, pp. 102313-102313
Open Access | Times Cited: 7

Generalizable and scalable multistage biomedical concept normalization leveraging large language models
Nicholas J Dobbins
Research Synthesis Methods (2025), pp. 1-12
Open Access

Had Enough of Experts? Quantitative Knowledge Retrieval From Large Language Models
David Selby, Yuichiro Iwashita, Kai Spriestersbach, et al.
Stat (2025) Vol. 14, Iss. 2
Open Access

Using ChatGPT for Entity Matching
Ralph Peeters, Christian Bizer
Communications in computer and information science (2023), pp. 221-230
Closed Access | Times Cited: 12

DeepJoin: Joinable Table Discovery with Pre-Trained Language Models
Yuyang Dong, Chuan Xiao, Takuma Nozawa, et al.
Proceedings of the VLDB Endowment (2023) Vol. 16, Iss. 10, pp. 2458-2470
Open Access | Times Cited: 11

Chorus: Foundation Models for Unified Data Discovery and Exploration
Moe Kayali, Anton Lykov, Ilias Fountalis, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 8, pp. 2104-2114
Open Access | Times Cited: 3

GPTuner: A Manual-Reading Database Tuning System via GPT-Guided Bayesian Optimization
Jiale Lao, Yibo Wang, Yufei Li, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 8, pp. 1939-1952
Open Access | Times Cited: 3

Quality issues in machine learning software systems
Pierre-Olivier Côté, Amin Nikanjam, Rached Bouchoucha, et al.
Empirical Software Engineering (2024) Vol. 29, Iss. 6
Closed Access | Times Cited: 3

Cleenex: Support for User Involvement during an Iterative Data Cleaning Process
João L. M. Pereira, Manuel J. Fonseca, Antónia Lopes, et al.
Journal of Data and Information Quality (2024) Vol. 16, Iss. 1, pp. 1-26
Open Access | Times Cited: 2

Addressing Data Scarcity in the Medical Domain: A GPT-Based Approach for Synthetic Data Generation and Feature Extraction
Fahim Sufi
Information (2024) Vol. 15, Iss. 5, pp. 264-264
Open Access | Times Cited: 2

ArcheType: A Novel Framework for Open-Source Column Type Annotation Using Large Language Models
Benjamin Feuer, Yurong Liu, Chinmay Hegde, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 9, pp. 2279-2292
Open Access | Times Cited: 2

Can Large Language Models Predict Data Correlations from Column Names?
Immanuel Trummer
Proceedings of the VLDB Endowment (2023) Vol. 16, Iss. 13, pp. 4310-4323
Closed Access | Times Cited: 7

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity
Haojun Xia, Zheng Zhen, Yuchao Li, et al.
Proceedings of the VLDB Endowment (2023) Vol. 17, Iss. 2, pp. 211-224
Open Access | Times Cited: 7

Repairing raw metadata for metadata management
Hiba Khalid, Esteban Zimányi
Information Systems (2024) Vol. 122, pp. 102344-102344
Closed Access | Times Cited: 1

Bard, ChatGPT and 3DGPT: A Scientometric Analysis of Generative AI Tools and Assessment of Implications for Mechanical Engineering Education
K.B. Mustapha, Eng Hwa Yap, Yousif Abdalla Abakr
(2024)
Open Access | Times Cited: 1

Toward generalizable structure‐based deep learning models for protein–ligand interaction prediction: Challenges and strategies
Seokhyun Moon, Wonho Zhung, Woo Youn Kim
Wiley Interdisciplinary Reviews Computational Molecular Science (2024) Vol. 14, Iss. 1
Closed Access | Times Cited: 1

DATALORE: Can a Large Language Model Find All Lost Scrolls in a Data Repository?
Yuze Lou, Chuan Lei, Xiao Qin, et al.
2022 IEEE 38th International Conference on Data Engineering (ICDE) (2024), pp. 5170-5176
Closed Access | Times Cited: 1

Automating the Enterprise with Foundation Models
Michael Wornow, Avanika Narayan, Krista Opsahl-Ong, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 11, pp. 2805-2812
Closed Access | Times Cited: 1

RetClean: Retrieval-Based Data Cleaning Using LLMs and Data Lakes
Zan Ahmad Naeem, Mohammad Shahmeer Ahmad, Mohamed Y. Eltabakh, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 12, pp. 4421-4424
Closed Access | Times Cited: 1

OmniscientDB: A Large Language Model-Augmented DBMS That Knows What Other DBMSs Do Not Know
Matthias Urban, D. Nguyen, Carsten Binnig
(2023)
Open Access | Times Cited: 4

Page 1 - Next Page

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-functional	1 year	The cookie is set by the GDPR Cookie Consent plugin to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.

Requested Article:

Showing 1-25 of 57 citing articles:

Your Privacy