OpenAlex Citation Counts

OpenAlex Citations Logo

OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!

If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.

Requested Article:

Efficient Performance Prediction for Apache Spark
Guoli Cheng, Shi Ying, Bingming Wang, et al.
Journal of Parallel and Distributed Computing (2020) Vol. 149, pp. 40-51
Closed Access | Times Cited: 37

Showing 1-25 of 37 citing articles:

Adaptive incremental transfer learning for efficient performance modeling of big data workloads
Mariano Garralda-Barrio, Carlos Eiras‐Franco, Verónica Bolón‐Canedo
Future Generation Computer Systems (2025), pp. 107730-107730
Closed Access

Evaluation of Multiple Apache Spark Applications using Kubernetes as a Cluster manager on Google Cloud
M. Jayanthi, K. Ram Mohan Rao, Vuppala Sukanya
Procedia Computer Science (2025) Vol. 252, pp. 576-582
Open Access

Runtime prediction of big data jobs: performance comparison of machine learning algorithms and analytical models
N. Ahmed, Andre L. C. Barczak, Mohammad A. Rashid, et al.
Journal Of Big Data (2022) Vol. 9, Iss. 1
Open Access | Times Cited: 14

Large-Scale Music Genre Analysis and Classification Using Machine Learning with Apache Spark
Mousumi Chaudhury, Amin Karami, Mustansar Ali Ghazanfar
Electronics (2022) Vol. 11, Iss. 16, pp. 2567-2567
Open Access | Times Cited: 14

A novel framework for generic Spark workload characterization and similar pattern recognition using machine learning
Mariano Garralda-Barrio, Carlos Eiras‐Franco, Verónica Bolón‐Canedo
Journal of Parallel and Distributed Computing (2024) Vol. 189, pp. 104881-104881
Open Access | Times Cited: 2

A parallelization model for performance characterization of Spark Big Data jobs on Hadoop clusters
N. Ahmed, Andre L. C. Barczak, Mohammad A. Rashid, et al.
Journal Of Big Data (2021) Vol. 8, Iss. 1
Open Access | Times Cited: 17

Distributed three-way formal concept analysis for large formal contexts
Raghavendra Kumar Chunduri, Aswani Kumar Cherukuri
Journal of Parallel and Distributed Computing (2022) Vol. 171, pp. 141-156
Closed Access | Times Cited: 8

TurBO: A cost-efficient configuration-based auto-tuning approach for cluster-based big data frameworks
Hui Dou, Lei Zhang, Yiwen Zhang, et al.
Journal of Parallel and Distributed Computing (2023) Vol. 177, pp. 89-105
Closed Access | Times Cited: 4

A Novel Multi-Task Performance Prediction Model for Spark
Chao Shen, Chen Chen, Guozheng Rao
Applied Sciences (2023) Vol. 13, Iss. 22, pp. 12242-12242
Open Access | Times Cited: 4

A Novel Reinforcement Learning Approach for Spark Configuration Parameter Optimization
Xu Huang, Hong Zhang, Xiaomeng Zhai
Sensors (2022) Vol. 22, Iss. 15, pp. 5930-5930
Open Access | Times Cited: 6

An Energy-Aware Resource Management Strategy Based on Spark and YARN in Heterogeneous Environments
Fatemeh Shabestari, Nima Jafari Navimipour
IEEE Transactions on Green Communications and Networking (2023) Vol. 8, Iss. 2, pp. 635-644
Closed Access | Times Cited: 3

Usages of Spark Framework with Different Machine Learning Algorithms
Mohamed Ali Mohamed, Ibrahim Elhenawy, Ahmad Salah
Computational Intelligence and Neuroscience (2021) Vol. 2021, pp. 1-7
Open Access | Times Cited: 7

An Enhanced Parallelisation Model for Performance Prediction of Apache Spark on a Multinode Hadoop Cluster
N. Ahmed, Andre L. C. Barczak, Mohammad A. Rashid, et al.
Big Data and Cognitive Computing (2021) Vol. 5, Iss. 4, pp. 65-65
Open Access | Times Cited: 6

SPOAHA: Spark Program Optimizer Based on Artificial Hummingbird Algorithm
Miao Wang, Jiteng Zhen, Yupeng Ma, et al.
Lecture notes in computer science (2023), pp. 317-331
Closed Access | Times Cited: 2

Tuning parameters of Apache Spark with Gauss–Pareto-based multi-objective optimization
Muhammed Maruf Öztürk
Knowledge and Information Systems (2023)
Closed Access | Times Cited: 2

A Genetic Algorithm For Boolean Semiring Matrix Factorization With Applications To Graph Mining
Γεώργιος Δρακόπουλος, Phivos Mylonas
2021 IEEE International Conference on Big Data (Big Data) (2022), pp. 3864-3870
Closed Access | Times Cited: 4

PERIDOT: Modeling Execution Time of Spark Applications
Sarah Shah, Yasaman Amannejad, Diwakar Krishnamurthy, et al.
IEEE Open Journal of the Computer Society (2021) Vol. 2, pp. 346-359
Open Access | Times Cited: 5

Performance optimization of Spark MLlib workloads using cost efficient RICG model on exponential projective sampling
Piyush Sewal, Hari Singh
Cluster Computing (2024) Vol. 27, Iss. 8, pp. 10569-10588
Closed Access

PerfTop: Towards Performance Prediction of Distributed Learning over General Topology
Changzhi Yan, Zehan Zhu, Youcheng Niu, et al.
Journal of Parallel and Distributed Computing (2024) Vol. 192, pp. 104922-104922
Closed Access

Adaptive memory reservation strategy for heavy workloads in the Spark environment
Bohan Li, Xin He, Junyang Yu, et al.
PeerJ Computer Science (2024) Vol. 10, pp. e2460-e2460
Open Access

High-speed parallel segmentation algorithms of MeanShift for litchi canopies based on Spark and Hadoop
Hongyi Xiong, Jianhua Wang, Yiming Xiao, et al.
Advances in Complex Systems (2024) Vol. 15, Iss. 03
Closed Access

AMORA: An Advanced Malleable and Operational Framework for Performance Prediction of Big Data Systems
Weiwei Lin, Haojun Xu, Haocheng Zhong, et al.
Software Practice and Experience (2024)
Open Access

Page 1 - Next Page

Scroll to top