NNSplit-SØREN: Supporting the model implementation of large neural networks in a programmable data plane

David Chunhu Li,Muhamad Rizka Maulana,Li-Der Chou

doi:10.1016/j.comnet.2022.109537

Abstract

Embedding neural network (NN) models in the data plane is one of the very promising and attractive ways to leverage the computational power of computer network switches. This method became possible with the advent of the P4 language controlling the programmable data plane. However, most data planes today have some constraints, such as a limited set of operations and limited memory size. The computational cost of training large-scale NNs is high. In addition, while complex large-scale NN architectures are often used to improve prediction accuracy, they affect the functional performance of the data plane because of factors such as numerous input parameters and complex model design. Therefore, determining how to reduce the performance cost incurred by implementing large NN models in the data plane is a critical issue that needs to be addressed. This research proposes a technique called Neural Network Split (NNSplit) to solve the performance problems of embedding a large NN in a data plane by splitting the NN layers across multiple data planes. To support layer splitting, a new protocol called SuppORting ComplEx Computation in the Network (SØREN) is also proposed. The SØREN protocol header carries the activation value and bridges the NN layers in all switches. A multi-class classification use case of network traffic is used as the context for the experimental analysis. Experimental results show that compared to non-splitting NN architectures, NNSplit can reduce memory usage by nearly 50% and increase network traffic throughput with the cost of a 14% increase in round-trip time. In addition, when the SØREN protocol is encapsulated into data packets, the average processing time of the switch is 773µs, which has very little impact on the processing time of the packets. Experimental results also show that the proposed NNSplit–SØREN can support large NN models on the data plane with a small performance cost.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

NNSplit-SØREN: Supporting the model implementation of large neural networks in a programmable data plane

Abstract

Talk to us

Similar Papers

More From: Computer Networks

Lead the way for us

Journal: Computer Networks	Publication Date: Dec 23, 2022
Citations: 8

Similar Papers

Symbolic Analysis for Data Plane Programs Specialization
Thomas Luinaud ... J M Pierre Langlois
ACM Transactions on Architecture and Code Optimization | VOL. 20
Thomas Luinaud, et. al.Thomas Luinaud ... J M Pierre Langlois
17 Nov 2022
ACM Transactions on Architecture and Code Optimization | VOL. 20

Run-time Performance Monitoring, Verification, and Healing of End-to-End Services
Nakjung Choi ... Marina Thottan
-
Nakjung Choi, et. al.Nakjung Choi ... Marina Thottan
01 Jun 2019
01 Jun 2019

A Scalable Network-on-Chip Based Neural Network Implementation on FPGAs
Thanh Thi Thanh Bui ... Braden Phillips
-
Thanh Thi Thanh Bui, et. al.Thanh Thi Thanh Bui ... Braden Phillips
01 Mar 2019
01 Mar 2019

Artificial Neural Nets Problem Solving Methods
José R Álvarez
-
José R ÁlvarezJosé R Álvarez
01 Jan 2003
01 Jan 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

NNSplit-SØREN: Supporting the model implementation of large neural networks in a programmable data plane

Abstract

Talk to us

Similar Papers

More From: Computer Networks