Parallel Scheduling of Multiple SDF Graphs Onto Heterogeneous Processors

Dowhan Jeong,Jangryul Kim,Soonhoi Ha,Mari-Liis Oldja

doi:10.1109/access.2021.3054725

Abstract

Parallel scheduling of multiple real-time applications onto heterogeneous processors is needed in the emerging embedded systems such as self-driving cars, smart cameras, and smartphones. Assuming that an embedded application is specified as a synchronous dataflow (SDF) graph or its extension, we propose a novel parallel scheduling methodology based on an evolutionary algorithm where the mapping of tasks onto processors is evolved to optimize a given objective function in an iterative fashion. In each iteration, we use an existing worst-case response time (WCRT) analysis tool to check if all applications satisfy their real-time requirements by translating each SDF graph into a directed acyclic graph (DAG) that is assumed in the WCRT analysis tool. Since the WCRT analysis must be performed in each iteration of evolution, we propose a clustering technique to reduce drastically the analysis time that depends on the number of nodes and their dependency. We formally prove that the proposed clustering technique does not change the estimated WCRT of each application. The effectiveness of the proposed scheduling methodology with the clustering technique is verified with extensive experiments using real-life benchmarks, randomly generated graphs, and the comparison with the existing technique.

Highlights

To cope with the increasing user demand for compute-intensive deep learning applications, embedded systems tend to equip heterogeneous processing elements (PEs) that include a multi-core CPU, a GPU, and/or a deep learning accelerator called a Neural Processing Unit (NPU)
We assume that an embedded application is specified as a synchronous dataflow (SDF) [2] graph or its extension
Since the number of data samples consumed from each input port or produced to each output port per task execution is fixed in the SDF model, we can construct an execution schedule of tasks statically

Summary

INTRODUCTION

To cope with the increasing user demand for compute-intensive deep learning applications, embedded systems tend to equip heterogeneous processing elements (PEs) that include a multi-core CPU, a GPU, and/or a deep learning accelerator called a Neural Processing Unit (NPU). The key constraint for node clustering is not to change the real-time performance by considering all possible interference scenarios between applications for given mapping and scheduling information of applications This constraint makes the proposed clustering technique distinguished from existent SDF clustering techniques ( [18], [19]) that do not consider mapping and scheduling. A novel parallel scheduling technique based on an evolutionary algorithm is proposed to schedule multiple SDF graphs with diverse real-time characteristics onto heterogeneous PEs. For the performance evaluation of each mapping candidate, it uses an existing WCRT analysis tool. We formally prove that the proposed clustering technique does not change the real-time performance that is estimated by the WCRT analysis tool.

RELATED WORK

PARALLEL SCHEDULING METHODOLOGY

NODE CLUSTERING TECHNIQUE

SUPPORTING NON-PREEMPTIVE PROCESSING ELEMENTS

TIME COMPLEXITY

DEPENDENCY RELAXATION OPTIMIZATION

EXPERIMENT

Findings

VIII. CONCLUSION AND FUTURE WORK

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 37	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Parallel Scheduling of Multiple SDF Graphs Onto Heterogeneous Processors

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Worst-Case Response Time Analysis of a Synchronous Dataflow Graph in a Multiprocessor System with Real-Time Tasks
Junchul Choi ... Soonhoi Ha
ACM Transactions on Design Automation of Electronic Systems | VOL. 22
Junchul Choi, et. al.Junchul Choi ... Soonhoi Ha
20 Jan 2017
ACM Transactions on Design Automation of Electronic Systems | VOL. 22

Worst case response time approach evaluation for computing can messages response time in an automotive network
Saulo Marcos Torres De Carvalho ... Gustavo Lobato Campos
-
Saulo Marcos Torres De Carvalho, et. al.Saulo Marcos Torres De Carvalho ... Gustavo Lobato Campos
01 Nov 2017
01 Nov 2017

WCRT Analysis and Evaluation for Sporadic Message-Processing Tasks in Multicore Automotive Gateways
Guoqi Xie ... Renfa Li
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 38
Guoqi Xie, et. al.Guoqi Xie ... Renfa Li
01 Feb 2019
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 38

Worst Case Response Time Analysis of Sporadic Tasks with Precedence Constrained Subtasks Using Non-preemptive EDF Scheduling
Armaghan Darbandi ... Myung Kyun Kim
-
Armaghan Darbandi, et. al.Armaghan Darbandi ... Myung Kyun Kim
20 Nov 2012
20 Nov 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parallel Scheduling of Multiple SDF Graphs Onto Heterogeneous Processors

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access