Real-Time Resource Allocation in Passive Optical Network for Energy-Efficient Inference at GPU-Based Network Edge

Yu Nakayama,Yukito Onodera,Yuko Hara-Azumi,Anh Hoang Ngoc Nguyen

doi:10.1109/jiot.2022.3155606

Abstract

In recent years, the advances in deep learning (DL) technology have greatly improved artificial intelligence (AI)-related research and services. Among them, real-time object recognition using network cameras has become an important technology for various applications. A large number of network cameras are being deployed for real-time object detection using DL models at GPU-based edge servers. A significant issue for widely deploying this type of systems is low-cost network deployment and low-latency data transmission. A promising option for efficiently accommodating numerous network cameras is time- and wavelength-division multiplexed passive optical network (TWDM-PON), which has prevailed in optical access network systems. The key challenge in a GPU-based inference system via TWDM-PON is to optimally allocate upstream wavelengths and bandwidths to enable real-time inference. To address this problem, this article proposes the concept of an inference system in which many cameras upload image data to a GPU-based edge server via TWDM-PON. A real-time resource allocation scheme for TWDM-PON is also proposed to guarantee low latency and time-synchronized data arrival at the edge. We formulated the wavelength and bandwidth allocation problem as a Boolean satisfiability problem (SAT) for fast computation. The performance of the proposed method is verified by computer simulation. The proposed scheme contributes to the increase in the batch size of arriving data at the edge server while ensuring low-latency data transmission. As a consequence, the computational efficiency of the GPU-based inference server is greatly improved by the increase in the batch size of data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Real-Time Resource Allocation in Passive Optical Network for Energy-Efficient Inference at GPU-Based Network Edge

Abstract

Talk to us

Similar Papers

More From: IEEE Internet of Things Journal

Lead the way for us

Journal: IEEE Internet of Things Journal	Publication Date: Sep 15, 2022
Citations: 5

Similar Papers

Real-Time and Energy-Efficient Inference at GPU-Based Network Edge using PON
Yukito Onodera ... Yu Nakayama
-
Yukito Onodera, et. al.Yukito Onodera ... Yu Nakayama
09 Jan 2021
09 Jan 2021

Real-time batch processing at a GPU-based edge with a passive optical network
Yukito Onodera ... Yu Nakayama
Journal of Optical Communications and Networking | VOL. 15
Yukito Onodera, et. al.Yukito Onodera ... Yu Nakayama
09 Jun 2023
Journal of Optical Communications and Networking | VOL. 15

Efficient Edge Server Placement under Latency and Load Balancing Constraints for Vehicular Networks
Sabri Khamari ... Toufik Ahmed
-
Sabri Khamari, et. al.Sabri Khamari ... Toufik Ahmed
04 Dec 2022
04 Dec 2022

A Survey and Future Perspectives of Hybrid Deep Learning Models for Text Classification
Samuel K Akpatsa ... Xiaoyu Li
-
Samuel K Akpatsa, et. al.Samuel K Akpatsa ... Xiaoyu Li
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Real-Time Resource Allocation in Passive Optical Network for Energy-Efficient Inference at GPU-Based Network Edge

Abstract

Talk to us

Similar Papers

More From: IEEE Internet of Things Journal