Evaluation of maxout activations in deep learning across several big data domains

Gabriel Castaneda,Taghi M Khoshgoftaar,Paul Morris

doi:10.1186/s40537-019-0233-0

Gabriel Castaneda, Taghi M Khoshgoftaar + Show 1 more

Open Access

https://doi.org/10.1186/s40537-019-0233-0

Copy DOI

Journal: Journal of Big Data	Publication Date: Aug 3, 2019
Citations: 25	License type: open-access

Affiliation: Florida Atlantic University

Abstract

This study investigates the effectiveness of multiple maxout activation function variants on 18 datasets using Convolutional Neural Networks. A network with maxout activation has a higher number of trainable parameters compared to networks with traditional activation functions. However, it is not clear if the activation function itself or the increase in the number of trainable parameters is responsible in yielding the best performance for different entity recognition tasks. This paper investigates if an increase in the number of convolutional filters on traditional activation functions performs equal-to or better-than maxout networks. Our experiments compare the Rectified Linear Unit, Leaky Rectified Linear Unit, Scaled Exponential Linear Unit, and Hyperbolic Tangent activations to four maxout function variants. We observe that maxout networks train relatively slower than networks with traditional activation functions, e.g. Rectified Linear Unit. In addition, we found that on average, across all datasets, the Rectified Linear Unit activation function performs better than any maxout activation when the number of convolutional filters is increased. Furthermore, adding more filters enhances the classification accuracy of the Rectified Linear Unit networks, without adversely affecting their advantage over maxout activations with respect to network-training speed.

Highlights

Deep networks have become very useful for many computer vision applications
As opposed to the papers cited we evaluate if an increase in the number of filters in Rectified Linear Unit (ReLU) enhances the overall accuracy with significance testing
The results from the image datasets indicate that sextupling the number of convolutional filters on ReLU performed better than the rest of the activation functions, but made training more difficult due to large number of parameters

Summary

Introduction

Deep neural networks (DNNs) are models composed of multiple layers that transform input data to outputs while learning increasingly higher-level features. Deep learning relies on learning several levels of hierarchical representations for data Due to their hierarchical structure, the parameters of a DNN can generally be tuned to approximate target functions more effectively than parameters in a shallow model [1]. Compared to traditional activation functions, like the logistic sigmoid units or tanh units, which are antisymmetric, ReLU is one-sided. This property encourages the hidden units to be sparse, Castaneda et al J Big Data (2019) 6:72 and more biologically plausible [6]. At Semeval-2015 (International Workshop on Semantic Evaluation), Severyn and Moschitti’s models ranked first in the phrase-level subtask A and second in the message-level subtask B

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluation of maxout activations in deep learning across several big data domains

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Big Data

Lead the way for us

Similar Papers

Maxout Neural Network for Big Data Medical Fraud Detection
Gabriel Castaneda ... Paul Morris
-
Gabriel Castaneda, et. al.Gabriel Castaneda ... Paul Morris
01 Apr 2019
01 Apr 2019

Maxout Networks for Visual Recognition
Gabriel Castaneda ... Paul Morris
International Journal of Multimedia Data Engineering and Management | VOL. 10
Gabriel Castaneda, et. al.Gabriel Castaneda ... Paul Morris
01 Oct 2019
International Journal of Multimedia Data Engineering and Management | VOL. 10

Performance Analysis of Activation Functions on Convolutional Neural Networks Using Cloud GPU
Vamshi Krishna Kayala ...
-
Vamshi Krishna Kayala, et. al.Vamshi Krishna Kayala ...
01 Jan 2020
01 Jan 2020

Deep Learning with Maxout Activations for Visual Recognition and Verification
Gabriel Castaneda ... Paul Morris
-
Gabriel Castaneda, et. al.Gabriel Castaneda ... Paul Morris
01 Jul 2019
01 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluation of maxout activations in deep learning across several big data domains

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Big Data