
OpenAlex is a bibliographic catalogue of scientific papers, authors and institutions accessible in open access mode, named after the Library of Alexandria. It's citation coverage is excellent and I hope you will find utility in this listing of citing articles!
If you click the article title, you'll navigate to the article, as listed in CrossRef. If you click the Open Access links, you'll navigate to the "best Open Access location". Clicking the citation count will open this listing for that article. Lastly at the bottom of the page, you'll find basic pagination options.
Requested Article:
Model-based Reinforcement Learning: A Survey
Thomas M. Moerland, Joost Broekens, Aske Plaat, et al.
Foundations and Trends® in Machine Learning (2023) Vol. 16, Iss. 1, pp. 1-118
Open Access | Times Cited: 304
Thomas M. Moerland, Joost Broekens, Aske Plaat, et al.
Foundations and Trends® in Machine Learning (2023) Vol. 16, Iss. 1, pp. 1-118
Open Access | Times Cited: 304
Showing 26-50 of 304 citing articles:
Intelligent and efficient fiber allocation strategy based on the dueling-double-deep Q-network
Yong Zhang, Zhipeng Yuan, Jia Ding, et al.
Frontiers of Engineering Management (2025)
Closed Access
Yong Zhang, Zhipeng Yuan, Jia Ding, et al.
Frontiers of Engineering Management (2025)
Closed Access
Reinforcement Learning-Assisted Ferroelectric Domain Wall Design Using a Machine Learning Phase-Field Surrogate
Kévin Alhada−Lahbabi, Damien Deleruyelle, Brice Gautier
ACS Applied Electronic Materials (2025)
Closed Access
Kévin Alhada−Lahbabi, Damien Deleruyelle, Brice Gautier
ACS Applied Electronic Materials (2025)
Closed Access
Online reinforcement learning for a continuous space system with experimental validation
Oguzhan Dogru, Nathan Wieczorek, Kirubakaran Velswamy, et al.
Journal of Process Control (2021) Vol. 104, pp. 86-100
Closed Access | Times Cited: 37
Oguzhan Dogru, Nathan Wieczorek, Kirubakaran Velswamy, et al.
Journal of Process Control (2021) Vol. 104, pp. 86-100
Closed Access | Times Cited: 37
Replay in minds and machines
Lennart Wittkuhn, Samson Chien, Sam Hall-McMaster, et al.
Neuroscience & Biobehavioral Reviews (2021) Vol. 129, pp. 367-388
Open Access | Times Cited: 34
Lennart Wittkuhn, Samson Chien, Sam Hall-McMaster, et al.
Neuroscience & Biobehavioral Reviews (2021) Vol. 129, pp. 367-388
Open Access | Times Cited: 34
MBRL-MC: An HVAC Control Approach via Combining Model-Based Deep Reinforcement Learning and Model Predictive Control
Liangliang Chen, Fei Meng, Ying Zhang
IEEE Internet of Things Journal (2022) Vol. 9, Iss. 19, pp. 19160-19173
Open Access | Times Cited: 23
Liangliang Chen, Fei Meng, Ying Zhang
IEEE Internet of Things Journal (2022) Vol. 9, Iss. 19, pp. 19160-19173
Open Access | Times Cited: 23
Machine Learning With Computer Networks: Techniques, Datasets, and Models
Haitham Afifi, Sabrina Pochaba, Andreas Boltres, et al.
IEEE Access (2024) Vol. 12, pp. 54673-54720
Open Access | Times Cited: 4
Haitham Afifi, Sabrina Pochaba, Andreas Boltres, et al.
IEEE Access (2024) Vol. 12, pp. 54673-54720
Open Access | Times Cited: 4
Combining Software-Defined and Delay-Tolerant Networking Concepts With Deep Reinforcement Learning Technology to Enhance Vehicular Networks
Olivia Nakayima, Mostafa I. Soliman, Kazunori Ueda, et al.
IEEE Open Journal of Vehicular Technology (2024) Vol. 5, pp. 721-736
Open Access | Times Cited: 4
Olivia Nakayima, Mostafa I. Soliman, Kazunori Ueda, et al.
IEEE Open Journal of Vehicular Technology (2024) Vol. 5, pp. 721-736
Open Access | Times Cited: 4
Reinforcement Learning for Autonomous Process Control in Industry 4.0: Advantages and Challenges
Nuria Nievas, Adela Pagès‐Bernaus, Francesc Bonada, et al.
Applied Artificial Intelligence (2024) Vol. 38, Iss. 1
Open Access | Times Cited: 4
Nuria Nievas, Adela Pagès‐Bernaus, Francesc Bonada, et al.
Applied Artificial Intelligence (2024) Vol. 38, Iss. 1
Open Access | Times Cited: 4
KineNN: Kinematic Neural Network for inverse model policy based on homogeneous transformation matrix and dual quaternion
Mochammad Rizky Diprasetya, Johannes Pöppelbaum, Andreas Schwung
Robotics and Computer-Integrated Manufacturing (2025) Vol. 94, pp. 102945-102945
Open Access
Mochammad Rizky Diprasetya, Johannes Pöppelbaum, Andreas Schwung
Robotics and Computer-Integrated Manufacturing (2025) Vol. 94, pp. 102945-102945
Open Access
Experimental validation of a semi-active fuzzy control strategy based on deep reinforcement learning for a piezoelectric smart isolation system
Tzu‐Kang Lin, Chandrasekhara Tappiti, Lyan‐Ywan Lu, et al.
Engineering Applications of Artificial Intelligence (2025) Vol. 144, pp. 110058-110058
Closed Access
Tzu‐Kang Lin, Chandrasekhara Tappiti, Lyan‐Ywan Lu, et al.
Engineering Applications of Artificial Intelligence (2025) Vol. 144, pp. 110058-110058
Closed Access
Enhancing Music Audio Signal Recognition through CNN-BiLSTM Fusion with De-noising Autoencoder for Improved Performance
Xin Mao, Ye Tian, Tao Jin, et al.
Neurocomputing (2025), pp. 129607-129607
Closed Access
Xin Mao, Ye Tian, Tao Jin, et al.
Neurocomputing (2025), pp. 129607-129607
Closed Access
RL-EAR: reinforcement learning-based energy-aware routing for software-defined wireless sensor network
Abhishek Narwaria, Varsha Kumari, Arka Prokash Mazumdar
The Journal of Supercomputing (2025) Vol. 81, Iss. 3
Closed Access
Abhishek Narwaria, Varsha Kumari, Arka Prokash Mazumdar
The Journal of Supercomputing (2025) Vol. 81, Iss. 3
Closed Access
Efficient Q-learning Hyperparameter Tuning Using FOX Optimization Algorithm
Mahmood A. Jumaah, Yossra H. Ali, Tarik A. Rashid
Results in Engineering (2025), pp. 104341-104341
Open Access
Mahmood A. Jumaah, Yossra H. Ali, Tarik A. Rashid
Results in Engineering (2025), pp. 104341-104341
Open Access
Federated learning based on dynamic hierarchical game incentives in Industrial Internet of Things
Y. A. Tang, Lina Ni, Jufeng Li, et al.
Advanced Engineering Informatics (2025) Vol. 65, pp. 103214-103214
Closed Access
Y. A. Tang, Lina Ni, Jufeng Li, et al.
Advanced Engineering Informatics (2025) Vol. 65, pp. 103214-103214
Closed Access
Trajectory self-correction and uncertainty estimation for enhanced model-based policy optimization
Shan Zhong, Xin Du, Kaijian Xia, et al.
Expert Systems with Applications (2025), pp. 126993-126993
Closed Access
Shan Zhong, Xin Du, Kaijian Xia, et al.
Expert Systems with Applications (2025), pp. 126993-126993
Closed Access
A Subgame Perfect Equilibrium Reinforcement Learning Approach to Time-Inconsistent Problems
Nixie S. Lesmana, Chi Seng Pun
SIAM Journal on Financial Mathematics (2025) Vol. 16, Iss. 1, pp. 68-122
Closed Access
Nixie S. Lesmana, Chi Seng Pun
SIAM Journal on Financial Mathematics (2025) Vol. 16, Iss. 1, pp. 68-122
Closed Access
Exploring Ensemble Error Exploration for Unsupervised Reinforcement Learning
Nutsu Shiman, Artem Latyshev, Petr Kuderov, et al.
Studies in computational intelligence (2025), pp. 199-209
Closed Access
Nutsu Shiman, Artem Latyshev, Petr Kuderov, et al.
Studies in computational intelligence (2025), pp. 199-209
Closed Access
Intelligent games meeting with multi-agent deep reinforcement learning: a comprehensive review
Yiqin Wang, Yufeng Wang, Feng Tian, et al.
Artificial Intelligence Review (2025) Vol. 58, Iss. 6
Open Access
Yiqin Wang, Yufeng Wang, Feng Tian, et al.
Artificial Intelligence Review (2025) Vol. 58, Iss. 6
Open Access
Optimization-based spectral end-to-end deep reinforcement learning for equity portfolio management
Pengrui Yu, Sixue Liu, Chengneng Jin, et al.
Pacific-Basin Finance Journal (2025), pp. 102746-102746
Closed Access
Pengrui Yu, Sixue Liu, Chengneng Jin, et al.
Pacific-Basin Finance Journal (2025), pp. 102746-102746
Closed Access
Machine learning approaches for active queue management: A survey, taxonomy, and future directions
Mohammad Parsa Toopchinezhad, Mahmood Ahmadi
Computer Networks (2025), pp. 111174-111174
Closed Access
Mohammad Parsa Toopchinezhad, Mahmood Ahmadi
Computer Networks (2025), pp. 111174-111174
Closed Access
Learning Policies for Automated Racing Using Vehicle Model Gradients
Nathan A. Spielberg, Maximilian Templer, John Subosits, et al.
IEEE Open Journal of Intelligent Transportation Systems (2023) Vol. 4, pp. 130-142
Open Access | Times Cited: 12
Nathan A. Spielberg, Maximilian Templer, John Subosits, et al.
IEEE Open Journal of Intelligent Transportation Systems (2023) Vol. 4, pp. 130-142
Open Access | Times Cited: 12
Memristive dynamics enabled neuromorphic computing systems
Bonan Yan, Yuchao Yang, Ru Huang
Science China Information Sciences (2023) Vol. 66, Iss. 10
Open Access | Times Cited: 12
Bonan Yan, Yuchao Yang, Ru Huang
Science China Information Sciences (2023) Vol. 66, Iss. 10
Open Access | Times Cited: 12
Learning Interaction-Aware Motion Prediction Model for Decision-Making in Autonomous Driving
Zhiyu Huang, Haochen Liu, Jingda Wu, et al.
2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC) (2023)
Open Access | Times Cited: 12
Zhiyu Huang, Haochen Liu, Jingda Wu, et al.
2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC) (2023)
Open Access | Times Cited: 12
Fast Human-in-the-Loop Control for HVAC Systems via Meta-Learning and Model-Based Offline Reinforcement Learning
Liangliang Chen, Fei Meng, Ying Zhang
IEEE Transactions on Sustainable Computing (2023) Vol. 8, Iss. 3, pp. 504-521
Closed Access | Times Cited: 11
Liangliang Chen, Fei Meng, Ying Zhang
IEEE Transactions on Sustainable Computing (2023) Vol. 8, Iss. 3, pp. 504-521
Closed Access | Times Cited: 11
Human-Like Decision-Making of Autonomous Vehicles in Dynamic Traffic Scenarios
Tangyike Zhang, Junxiang Zhan, Jiamin Shi, et al.
IEEE/CAA Journal of Automatica Sinica (2023) Vol. 10, Iss. 10, pp. 1905-1917
Closed Access | Times Cited: 11
Tangyike Zhang, Junxiang Zhan, Jiamin Shi, et al.
IEEE/CAA Journal of Automatica Sinica (2023) Vol. 10, Iss. 10, pp. 1905-1917
Closed Access | Times Cited: 11