Optimization of Deep Neural Networks Using SoCs with OpenCL.

Rafael Gadea-Gironés,Vicente Herrero-Bosch,Ricardo Colom-Palero

doi:10.3390/s18051384

Rafael Gadea-Gironés, Vicente Herrero-Bosch + Show 1 more

Open Access

https://doi.org/10.3390/s18051384

Copy DOI

Abstract

In the optimization of deep neural networks (DNNs) via evolutionary algorithms (EAs) and the implementation of the training necessary for the creation of the objective function, there is often a trade-off between efficiency and flexibility. Pure software solutions implemented on general-purpose processors tend to be slow because they do not take advantage of the inherent parallelism of these devices, whereas hardware realizations based on heterogeneous platforms (combining central processing units (CPUs), graphics processing units (GPUs) and/or field-programmable gate arrays (FPGAs)) are designed based on different solutions using methodologies supported by different languages and using very different implementation criteria. This paper first presents a study that demonstrates the need for a heterogeneous (CPU-GPU-FPGA) platform to accelerate the optimization of artificial neural networks (ANNs) using genetic algorithms. Second, the paper presents implementations of the calculations related to the individuals evaluated in such an algorithm on different (CPU- and FPGA-based) platforms, but with the same source files written in OpenCL. The implementation of individuals on remote, low-cost FPGA systems on a chip (SoCs) is found to enable the achievement of good efficiency in terms of performance per watt.

Highlights

Artificial neural networks (ANNs) are widely used in many areas of research and have produced very promising results.the topological design of an ANN determines its usefulness, because it significantly influences the network’s performance [1]
This uncertainty may be deemed acceptable by researchers if the other ANN conditions are fixed; when all of these parameters must be evaluated as part of the same research, a long period of experimentation is required to determine the optimal topology
Our method consists of the following phases: Phase 1: Selection of the best inputs via evolutionary computation based on the delta test

Summary

Introduction

Artificial neural networks (ANNs) are widely used in many areas of research and have produced very promising results. When good results are achieved, it is not clear whether those results are optimal among all network structures This uncertainty may be deemed acceptable by researchers if the other ANN conditions (for example, the numbers of training samples and input variables) are fixed; when all of these parameters must be evaluated as part of the same research, a long period of experimentation is required to determine the optimal topology. Structures with the same representation (genotype) may show quite different levels of fitness This one-to-many mapping from the genotypes to the actual networks (phenotypes) may induce noisy fitness evaluations and result in misleading evolution. To reduce such noise, an architecture should usually be trained many times using different random initial weights.

Evolutionary Optimization Method

Optimization of the Topology

Control of Random Number Generation

Weight Initialization

Heterogeneous Computational Platform

Implementation of the Individuals

Calculation of Weight Changes Tcw

Update of Weights Tuw

Version 1

Version 2

Version 3

Preliminary Comments

Analysis of Hardware Results

Analysis of Performance Results

Preliminary Comments our we

Performance Efficiency

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Apr 30, 2018
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Optimization of Deep Neural Networks Using SoCs with OpenCL.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Evolutionary optimization of neural networks with heterogeneous computation: study and implementation
Jorge D Fe ... Rafael Gadea-Gironés
The Journal of Supercomputing | VOL. 71
Jorge D Fe, et. al.Jorge D Fe ... Rafael Gadea-Gironés
11 Apr 2015
The Journal of Supercomputing | VOL. 71

Artificial Neural Networks in Evaluation and Optimization of Modified Release Solid Dosage Forms
Svetlana Ibrić ... Jelena Djuriš
Pharmaceutics | VOL. 4
Svetlana Ibrić, et. al.Svetlana Ibrić ... Jelena Djuriš
18 Oct 2012
Pharmaceutics | VOL. 4

Accelerating Deterministic and Stochastic Binarized Neural Networks on FPGAs Using OpenCL
Corey Lammie ... Wei Xiang
-
Corey Lammie, et. al.Corey Lammie ... Wei Xiang
15 May 2019
15 May 2019

EMG motion pattern classification through design and optimization of Neural Network
Md Rezwanul Ahsan ... Muhammad Ibn Ibrahimy
-
Md Rezwanul Ahsan, et. al.Md Rezwanul Ahsan ... Muhammad Ibn Ibrahimy
01 Feb 2012
01 Feb 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimization of Deep Neural Networks Using SoCs with OpenCL.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)