
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
Can Foundation Models Wrangle Your Data?
Avanika Narayan, Ines Chami, Laurel Orr, et al.
Proceedings of the VLDB Endowment (2022) Vol. 16, Iss. 4, pp. 738-746
Open Access | Times Cited: 57
Avanika Narayan, Ines Chami, Laurel Orr, et al.
Proceedings of the VLDB Endowment (2022) Vol. 16, Iss. 4, pp. 738-746
Open Access | Times Cited: 57
Showing 1-25 of 57 citing articles:
CancerGPT for few shot drug pair synergy prediction using large pretrained language models
Tianhao Li, Sandesh Shetty, Advaith Kamath, et al.
npj Digital Medicine (2024) Vol. 7, Iss. 1
Open Access | Times Cited: 38
Tianhao Li, Sandesh Shetty, Advaith Kamath, et al.
npj Digital Medicine (2024) Vol. 7, Iss. 1
Open Access | Times Cited: 38
Generative Pre-Trained Transformer (GPT) in Research: A Systematic Review on Data Augmentation
Fahim Sufi
Information (2024) Vol. 15, Iss. 2, pp. 99-99
Open Access | Times Cited: 25
Fahim Sufi
Information (2024) Vol. 15, Iss. 2, pp. 99-99
Open Access | Times Cited: 25
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes
Simran Arora, Brandon Yang, Sabri Eyuboglu, et al.
Proceedings of the VLDB Endowment (2023) Vol. 17, Iss. 2, pp. 92-105
Open Access | Times Cited: 24
Simran Arora, Brandon Yang, Sabri Eyuboglu, et al.
Proceedings of the VLDB Endowment (2023) Vol. 17, Iss. 2, pp. 92-105
Open Access | Times Cited: 24
Data cleaning and machine learning: a systematic literature review
Pierre-Olivier Côté, Amin Nikanjam, Nafisa Ahmed, et al.
Automated Software Engineering (2024) Vol. 31, Iss. 2
Closed Access | Times Cited: 12
Pierre-Olivier Côté, Amin Nikanjam, Nafisa Ahmed, et al.
Automated Software Engineering (2024) Vol. 31, Iss. 2
Closed Access | Times Cited: 12
Construction of Knowledge Graphs: Current State and Challenges
Marvin Hofer, Daniel Obraczka, Alieh Saeedi, et al.
Information (2024) Vol. 15, Iss. 8, pp. 509-509
Open Access | Times Cited: 8
Marvin Hofer, Daniel Obraczka, Alieh Saeedi, et al.
Information (2024) Vol. 15, Iss. 8, pp. 509-509
Open Access | Times Cited: 8
To prompt or not to prompt: Navigating the use of large language models for integrating and modeling heterogeneous data
Adel Remadi, Karim El Hage, Yasmina Hobeika, et al.
Data & Knowledge Engineering (2024) Vol. 152, pp. 102313-102313
Open Access | Times Cited: 7
Adel Remadi, Karim El Hage, Yasmina Hobeika, et al.
Data & Knowledge Engineering (2024) Vol. 152, pp. 102313-102313
Open Access | Times Cited: 7
Generalizable and scalable multistage biomedical concept normalization leveraging large language models
Nicholas J Dobbins
Research Synthesis Methods (2025), pp. 1-12
Open Access
Nicholas J Dobbins
Research Synthesis Methods (2025), pp. 1-12
Open Access
Had Enough of Experts? Quantitative Knowledge Retrieval From Large Language Models
David Selby, Yuichiro Iwashita, Kai Spriestersbach, et al.
Stat (2025) Vol. 14, Iss. 2
Open Access
David Selby, Yuichiro Iwashita, Kai Spriestersbach, et al.
Stat (2025) Vol. 14, Iss. 2
Open Access
Using ChatGPT for Entity Matching
Ralph Peeters, Christian Bizer
Communications in computer and information science (2023), pp. 221-230
Closed Access | Times Cited: 12
Ralph Peeters, Christian Bizer
Communications in computer and information science (2023), pp. 221-230
Closed Access | Times Cited: 12
DeepJoin: Joinable Table Discovery with Pre-Trained Language Models
Yuyang Dong, Chuan Xiao, Takuma Nozawa, et al.
Proceedings of the VLDB Endowment (2023) Vol. 16, Iss. 10, pp. 2458-2470
Open Access | Times Cited: 11
Yuyang Dong, Chuan Xiao, Takuma Nozawa, et al.
Proceedings of the VLDB Endowment (2023) Vol. 16, Iss. 10, pp. 2458-2470
Open Access | Times Cited: 11
Chorus: Foundation Models for Unified Data Discovery and Exploration
Moe Kayali, Anton Lykov, Ilias Fountalis, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 8, pp. 2104-2114
Open Access | Times Cited: 3
Moe Kayali, Anton Lykov, Ilias Fountalis, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 8, pp. 2104-2114
Open Access | Times Cited: 3
GPTuner: A Manual-Reading Database Tuning System via GPT-Guided Bayesian Optimization
Jiale Lao, Yibo Wang, Yufei Li, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 8, pp. 1939-1952
Open Access | Times Cited: 3
Jiale Lao, Yibo Wang, Yufei Li, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 8, pp. 1939-1952
Open Access | Times Cited: 3
Quality issues in machine learning software systems
Pierre-Olivier Côté, Amin Nikanjam, Rached Bouchoucha, et al.
Empirical Software Engineering (2024) Vol. 29, Iss. 6
Closed Access | Times Cited: 3
Pierre-Olivier Côté, Amin Nikanjam, Rached Bouchoucha, et al.
Empirical Software Engineering (2024) Vol. 29, Iss. 6
Closed Access | Times Cited: 3
Cleenex: Support for User Involvement during an Iterative Data Cleaning Process
João L. M. Pereira, Manuel J. Fonseca, Antónia Lopes, et al.
Journal of Data and Information Quality (2024) Vol. 16, Iss. 1, pp. 1-26
Open Access | Times Cited: 2
João L. M. Pereira, Manuel J. Fonseca, Antónia Lopes, et al.
Journal of Data and Information Quality (2024) Vol. 16, Iss. 1, pp. 1-26
Open Access | Times Cited: 2
Addressing Data Scarcity in the Medical Domain: A GPT-Based Approach for Synthetic Data Generation and Feature Extraction
Fahim Sufi
Information (2024) Vol. 15, Iss. 5, pp. 264-264
Open Access | Times Cited: 2
Fahim Sufi
Information (2024) Vol. 15, Iss. 5, pp. 264-264
Open Access | Times Cited: 2
ArcheType: A Novel Framework for Open-Source Column Type Annotation Using Large Language Models
Benjamin Feuer, Yurong Liu, Chinmay Hegde, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 9, pp. 2279-2292
Open Access | Times Cited: 2
Benjamin Feuer, Yurong Liu, Chinmay Hegde, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 9, pp. 2279-2292
Open Access | Times Cited: 2
Can Large Language Models Predict Data Correlations from Column Names?
Immanuel Trummer
Proceedings of the VLDB Endowment (2023) Vol. 16, Iss. 13, pp. 4310-4323
Closed Access | Times Cited: 7
Immanuel Trummer
Proceedings of the VLDB Endowment (2023) Vol. 16, Iss. 13, pp. 4310-4323
Closed Access | Times Cited: 7
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity
Haojun Xia, Zheng Zhen, Yuchao Li, et al.
Proceedings of the VLDB Endowment (2023) Vol. 17, Iss. 2, pp. 211-224
Open Access | Times Cited: 7
Haojun Xia, Zheng Zhen, Yuchao Li, et al.
Proceedings of the VLDB Endowment (2023) Vol. 17, Iss. 2, pp. 211-224
Open Access | Times Cited: 7
Repairing raw metadata for metadata management
Hiba Khalid, Esteban Zimányi
Information Systems (2024) Vol. 122, pp. 102344-102344
Closed Access | Times Cited: 1
Hiba Khalid, Esteban Zimányi
Information Systems (2024) Vol. 122, pp. 102344-102344
Closed Access | Times Cited: 1
Bard, ChatGPT and 3DGPT: A Scientometric Analysis of Generative AI Tools and Assessment of Implications for Mechanical Engineering Education
K.B. Mustapha, Eng Hwa Yap, Yousif Abdalla Abakr
(2024)
Open Access | Times Cited: 1
K.B. Mustapha, Eng Hwa Yap, Yousif Abdalla Abakr
(2024)
Open Access | Times Cited: 1
Toward generalizable structure‐based deep learning models for protein–ligand interaction prediction: Challenges and strategies
Seokhyun Moon, Wonho Zhung, Woo Youn Kim
Wiley Interdisciplinary Reviews Computational Molecular Science (2024) Vol. 14, Iss. 1
Closed Access | Times Cited: 1
Seokhyun Moon, Wonho Zhung, Woo Youn Kim
Wiley Interdisciplinary Reviews Computational Molecular Science (2024) Vol. 14, Iss. 1
Closed Access | Times Cited: 1
DATALORE: Can a Large Language Model Find All Lost Scrolls in a Data Repository?
Yuze Lou, Chuan Lei, Xiao Qin, et al.
2022 IEEE 38th International Conference on Data Engineering (ICDE) (2024), pp. 5170-5176
Closed Access | Times Cited: 1
Yuze Lou, Chuan Lei, Xiao Qin, et al.
2022 IEEE 38th International Conference on Data Engineering (ICDE) (2024), pp. 5170-5176
Closed Access | Times Cited: 1
Automating the Enterprise with Foundation Models
Michael Wornow, Avanika Narayan, Krista Opsahl-Ong, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 11, pp. 2805-2812
Closed Access | Times Cited: 1
Michael Wornow, Avanika Narayan, Krista Opsahl-Ong, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 11, pp. 2805-2812
Closed Access | Times Cited: 1
RetClean: Retrieval-Based Data Cleaning Using LLMs and Data Lakes
Zan Ahmad Naeem, Mohammad Shahmeer Ahmad, Mohamed Y. Eltabakh, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 12, pp. 4421-4424
Closed Access | Times Cited: 1
Zan Ahmad Naeem, Mohammad Shahmeer Ahmad, Mohamed Y. Eltabakh, et al.
Proceedings of the VLDB Endowment (2024) Vol. 17, Iss. 12, pp. 4421-4424
Closed Access | Times Cited: 1
OmniscientDB: A Large Language Model-Augmented DBMS That Knows What Other DBMSs Do Not Know
Matthias Urban, D. Nguyen, Carsten Binnig
(2023)
Open Access | Times Cited: 4
Matthias Urban, D. Nguyen, Carsten Binnig
(2023)
Open Access | Times Cited: 4