An FRTDS Real-Time Simulation Optimized Task Scheduling Algorithm Based on Reinforcement Learning

Y Guan,Bd Zhang,Z Jin

doi:10.1109/access.2020.2997037

Abstract

This paper presents a deep reinforcement learning (DRL)-based task scheduling algorithm that is applied to an FPGA-based real-time digital simulation (FRTDS) system to generate arrangements to minimize the makespan of a task sequence with limited resources. The algorithm has two parts, which are synthetic cost construction and DRL processing to make arrangements. The synthetic cost represents the cost of different selections of arrangements in both resource usage and blockage arranging probability. This study uses the cost to measure the state-action value function to process the deep Q network (DQN) procedure to generate an optimized scheduling strategy. We establish the reinforcement learning strategy generation process by instantiating the computing components in the hardware as agents, and RAM resources and communication I/O ports as environment. A hardware-design-based decision rule is constructed to ensure that the computing variables are distributed as evenly as possible in storage, while making full use of the pipeline characteristics of FPGA. A compiler is written to generate an FRTDS binary stream to drive FRTDS. Accuracy and performance of the proposed method are verified and evaluated. We present simulation results of the modeling method, as well as from a classic method. Comparing these results, the makespan obtained by the proposed method is significantly shorter. It corresponds to the possibility of having higher computing power and dealing with larger-scale real-time simulation.

Highlights

Real-time simulation is of great significance to control system design, hardware equipment testing, and staff training
A low-cost real time simulation system based on a digital signal processor (DSP) was built for educational purposes [1]. [2] emphasized the importance of the analysis of microgrids with a real time digital simulator (RTDS) and [3] built a co-simulation framework that can assess microgrids with hardware-in-the-loop testing approaches
We propose a reinforcement learning (RL)-based algorithm that takes resource usage as parameters to describe the cost of task selection, and whose principle is the balanced storage of variables as an arrangement

Summary

INTRODUCTION

Real-time simulation is of great significance to control system design, hardware equipment testing, and staff training. We propose a reinforcement learning (RL)-based algorithm that takes resource usage as parameters to describe the cost of task selection, and whose principle is the balanced storage of variables as an arrangement. According to the previous analysis, blocks emerge when conflicts exist in reading and writing addresses or communicating, because the full-state feedback of resources at a certain hardware clock in the future cannot be explicitly obtained. Even if a variable exists in the RAM storage area before the specified time, there are still cases when the RAM does not belong to the private RAM of the current computing component In this case, the current task selection will need communication to arrange subsequent tasks, which will incur additional blocking costs. We add a max epochs limit to stop the iteration in the case of exceptions

REINFORCEMENT LEARNING BASED ON COST

DECISION RULES AND DECISION ORDER OF COMPUTING COMPONENTS

COMPUTING TASK OPTIMIZED SCHEDULING ALGORITHM BASED ON FRTDS

FRTDS EXAMPLE VERIFICATION AND RESULTS ANALYSIS

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 17	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An FRTDS Real-Time Simulation Optimized Task Scheduling Algorithm Based on Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Deep reinforcement learning for automated radiation adaptation in lung cancer.
Huan‐Hsin Tseng ... Randall K Ten Haken
Medical Physics | VOL. 44
Huan‐Hsin Tseng, et. al.Huan‐Hsin Tseng ... Randall K Ten Haken
14 Nov 2017
Medical Physics | VOL. 44

Enhancement of smart grid technology using end-to-end reinforcement learning in deep q-network
Dhruvi Bansal ... Ayush Dogra
-
Dhruvi Bansal, et. al.Dhruvi Bansal ... Ayush Dogra
01 Jan 2021
01 Jan 2021

Pursuit and Evasion Strategy of a Differential Game Based on Deep Reinforcement Learning.
Can Xu ... Ligang Dong
Frontiers in bioengineering and biotechnology | VOL. 10
Can Xu, et. al.Can Xu ... Ligang Dong
22 Mar 2022
Frontiers in bioengineering and biotechnology | VOL. 10

Break through the limits of learning by machines
Zhongzhi Shi
Chinese Science Bulletin | VOL. 61
Zhongzhi ShiZhongzhi Shi
20 Sep 2016
Chinese Science Bulletin | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An FRTDS Real-Time Simulation Optimized Task Scheduling Algorithm Based on Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access