Scheduling Overhead Research Articles

Recently, various infrastructure-assisted or onboard driving assistant applications have been proposed as a component of intelligent transportation systems (ITS) to improve the transportation system’s efficiency and release public concern about road safety. However, such AI-assisted intelligent applications are mainly data-driven and put great demands on the computing power of the ITS systems. Therefore, in the highly dynamic Internet-of-Vehicles environment in ITS, how to effectively coordinate the limited computing power of the various components of the system and realize reliable support for such resource-consuming applications through efficient resource allocation methods is the focus of our research. Accordingly, a novel joint computing and communication resource scheduling method is proposed to fulfill those ITS applications’ inherent heterogeneous quality of service (QoS) requirements. By fully exploiting the computing resources provided by the onboard computing device, the edge computing device located in the vehicle’s proximity and remote data center, we designed a hierarchical three-layer Vehicular Edge Computing (VEC) framework. Briefly, an onboard joint computation offloading and transmission scheduling policy is designed to assign corresponding offloading decisions to the locally generated computing tasks by considering the vehicle’s computing resources and real-time network link status. Additionally, a new distributed resource allocation policy is developed for the edge devices, in which we derive a server selection policy and allocate communication time based on our proposed metric. To evaluate the performance and validate the effectiveness of our proposed method, we conduct extensive simulation tests and ablation experiments, respectively. The results show that our approach can achieve stable performance in various experimental settings. Also, compared to the state-of-the-art algorithms, our joint resource allocation policy significantly reduces the scheduling overhead, improves the utilization of system resources, and minimizes the data transmission delay caused by vehicle motion.

Read full abstract

With the widespread use of real-time data analysis by artificial intelligence (AI), the integration of accelerators is attracting attention from the perspectives of their low power consumption and low latency. The objective of this research is to increase accelerator resource efficiency and further reduce power consumption by sharing accelerators among multiple users while maintaining real-time performance. To achieve the accelerator-sharing system, we define three requirements: high device utilization, fair device utilization among users, and real-time performance. Targeting the AI inference use case, this paper proposes a system that shares a field-programmable gate array (FPGA) among multiple users by switching the convolutional neural network (CNN) models stored in the device memory on the FPGA, while satisfying the three requirements. The proposed system uses different behavioral models for workloads with predictable and unpredictable data arrival timing. For the workloads with predictable data arrival timing, the system uses spatial-division multiplexing of the FPGA device memory to achieve real-time performance and high device utilization. Specifically, the FPGA device memory controller of the system transparently preloads and caches the CNN models into the FPGA device memory before the data arrival. For workloads with unpredictable data arrival timing, the system transfers CNN models to the FPGA device memory upon data arrival using time-division multiplexing of FPGA device memory. In the latter case of unpredictable workloads, the switch cost between CNN models is non-negligible to achieve real-time performance and high device utilization, so the system integrates a new scheduling algorithm that considers the switch time of the CNN models. For both predictable and unpredictable workloads, user fairness is achieved by using an ageing technique in the scheduling algorithm that increases the priority of jobs in accordance with the job waiting time. The evaluation results show that the scheduling overhead of the proposed system is negligible for both predictable and unpredictable workloads providing practical real-time performance. For unpredictable workloads, the new scheduling algorithm improves fairness by 24%–94% and resource efficiency by 31%–33% compared to traditional algorithms using first-come first-served or round-robin. For predictable workloads, the system improves fairness by 50.5 % compared to first-come first-served and achieves 99.5 % resource efficiency.

Read full abstract

Scheduling Overhead Research Articles

Related Topics

Articles published on Scheduling Overhead

Gem5-NVDLA: A Simulation Framework for Compiling, Scheduling and Architecture Evaluation on AI System-on-Chips

Low-cost and high-performance channel access strategies for Internet of Nano-Things applications

Efficient end-to-end long-read sequence mapping using minimap2-fpga integrated with hardware accelerated chaining

A novel hierarchical distributed vehicular edge computing framework for supporting intelligent driving

A New Federated Scheduling Algorithm for Arbitrary-Deadline DAG Tasks

Parallel Strong Connectivity Based on Faster Reachability

Point cloud segmentation of overhead contact systems with deep learning in high-speed rails

Soft-core processor integration based on different instruction set architectures and field programmable gate array custom datapath implementation.

An enhanced ordinal optimization with lower scheduling overhead based novel approach for task scheduling in cloud computing environment

Dynamic Virtual Machine Consolidation in the Cloud: A Cuckoo Search Approach

Optimizing Iterative Data-flow Scientific Applications using Directed Cyclic Graphs

Efficient schedulability analysis of hierarchical EDF scheduling with resource sharing

Tenant-Grained Request Scheduling in Software-Defined Cloud Computing

Task Scheduling for Probabilistic In -Band Network Telemetry

Lightweight Monocular Depth Estimation on Edge Devices

EALI: Energy-aware layer-level scheduling for convolutional neural network inference services on GPUs

Designing a Custom CPU Architecture Based on Hardware RTOS and Dynamic Preemptive Scheduler

On the choice of the best chunk size for the speculative execution of loops.

Spatial- and time- division multiplexing in CNN accelerator

Early scheduling on steroids: Boosting parallel state machine replication

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Scheduling Overhead Research Articles

Related Topics

Articles published on Scheduling Overhead

Gem5-NVDLA: A Simulation Framework for Compiling, Scheduling and Architecture Evaluation on AI System-on-Chips

Low-cost and high-performance channel access strategies for Internet of Nano-Things applications

Efficient end-to-end long-read sequence mapping using minimap2-fpga integrated with hardware accelerated chaining

A novel hierarchical distributed vehicular edge computing framework for supporting intelligent driving

A New Federated Scheduling Algorithm for Arbitrary-Deadline DAG Tasks

Parallel Strong Connectivity Based on Faster Reachability

Point cloud segmentation of overhead contact systems with deep learning in high-speed rails

Soft-core processor integration based on different instruction set architectures and field programmable gate array custom datapath implementation.

An enhanced ordinal optimization with lower scheduling overhead based novel approach for task scheduling in cloud computing environment

Dynamic Virtual Machine Consolidation in the Cloud: A Cuckoo Search Approach

Optimizing Iterative Data-flow Scientific Applications using Directed Cyclic Graphs

Efficient schedulability analysis of hierarchical EDF scheduling with resource sharing

Tenant-Grained Request Scheduling in Software-Defined Cloud Computing

Task Scheduling for Probabilistic In -Band Network Telemetry

Lightweight Monocular Depth Estimation on Edge Devices

EALI: Energy-aware layer-level scheduling for convolutional neural network inference services on GPUs

Designing a Custom CPU Architecture Based on Hardware RTOS and Dynamic Preemptive Scheduler

On the choice of the best chunk size for the speculative execution of loops.

Spatial- and time- division multiplexing in CNN accelerator

Early scheduling on steroids: Boosting parallel state machine replication