Fast, Simple and Accurate Handwritten Digit Classification by Training Shallow Neural Network Classifiers with the 'Extreme Learning Machine' Algorithm.

Mark D Mcdonnell,Migel D Tissera,Tony Vladusich,André Van Schaik,Jonathan Tapson

doi:10.1371/journal.pone.0134254

Mark D Mcdonnell, Migel D Tissera + Show 3 more

Open Access

https://doi.org/10.1371/journal.pone.0134254

Copy DOI

Journal: PLOS ONE	Publication Date: Aug 11, 2015
Citations: 70	License type: CC BY 4.0

Affiliation: University of South Australia, Western Sydney University

Abstract

Recent advances in training deep (multi-layer) architectures have inspired a renaissance in neural network use. For example, deep convolutional networks are becoming the default option for difficult tasks on large datasets, such as image and speech recognition. However, here we show that error rates below 1% on the MNIST handwritten digit benchmark can be replicated with shallow non-convolutional neural networks. This is achieved by training such networks using the ‘Extreme Learning Machine’ (ELM) approach, which also enables a very rapid training time (∼ 10 minutes). Adding distortions, as is common practise for MNIST, reduces error rates even further. Our methods are also shown to be capable of achieving less than 5.5% error rates on the NORB image database. To achieve these results, we introduce several enhancements to the standard ELM algorithm, which individually and in combination can significantly improve performance. The main innovation is to ensure each hidden-unit operates only on a randomly sized and positioned patch of each image. This form of random ‘receptive field’ sampling of the input ensures the input weight matrix is sparse, with about 90% of weights equal to zero. Furthermore, combining our methods with a small number of iterations of a single-batch backpropagation method can significantly reduce the number of hidden-units required to achieve a particular performance. Our close to state-of-the-art results for MNIST and NORB suggest that the ease of use and accuracy of the ELM algorithm for designing a single-hidden-layer neural network classifier should cause it to be given greater consideration either as a standalone method for simpler problems, or as the final classification stage in deep neural networks applied to more difficult problems.

Highlights

The current renaissance in the field of neural networks is a direct result of the success of various types of deep network in tackling difficult classification and regression problems on large datasets
The initial excitement over Convolutional Neural Networks (CNN) and Deep Belief Networks (DBN) methods was triggered by their success on the MNIST handwritten digit recognition problem [1], which was for several years the standard benchmark problem for hard, large dataset machine learning
We have found that enhanced classification performance can be achieved by combining the shaped weights obtained by either CIW-Extreme Learning Machine’ (ELM) or Constrained ELM (C-ELM) with the receptive field masks provided by Receptive Field ELM (RF-ELM)

Summary

Introduction

The current renaissance in the field of neural networks is a direct result of the success of various types of deep network in tackling difficult classification and regression problems on large datasets. A high accuracy on MNIST is regarded as a basic requirement for credibility in a classification algorithm Both CNN and DBN methods were notable, when first published, for posting the best results up to that respective time on the MNIST problem. We introduce variations of the Extreme Learning Machine algorithm [10] and report their performance on the MNIST test set These results are equivalent or superior to the original results achieved by CNN and DBN on this problem, and are achieved with significantly lower network and training complexity. The standard ELM algorithm can provide very good results in machine learning problems requiring classification or regression (function optimization); in this paper we demonstrate that it provides an accuracy on the MNIST problem superior to prior reported results for -sized SLFN networks [1, 15]

Method

Results and discussion for the MNIST benchmark

Results on NORB

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fast, Simple and Accurate Handwritten Digit Classification by Training Shallow Neural Network Classifiers with the 'Extreme Learning Machine' Algorithm.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Local Receptive Fields Based Extreme Learning Machine
Guang-Bin Huang ... Zuo Bai
IEEE Computational Intelligence Magazine | VOL. 10
Guang-Bin Huang, et. al.Guang-Bin Huang ... Zuo Bai
01 May 2015
IEEE Computational Intelligence Magazine | VOL. 10

A modified batch intrinsic plasticity method for pre-training the random coefficients of extreme learning machines
Suchuan Dong ... Zongwei Li
Journal of Computational Physics | VOL. 445
Suchuan Dong, et. al.Suchuan Dong ... Zongwei Li
28 Jul 2021
Journal of Computational Physics | VOL. 445

A GPU-based accelerated ELM and deep-ELM training algorithms for traditional and deep neural networks classifiers
Arezoo Moradi Chegni ... Behnam Ghavami
Intelligent Systems with Applications | VOL. 15
Arezoo Moradi Chegni, et. al.Arezoo Moradi Chegni ... Behnam Ghavami
01 Sep 2022
Intelligent Systems with Applications | VOL. 15

Extreme learning machine based on improved genetic algorithm
Bin Jiao ... Ting Zhang
-
Bin Jiao, et. al.Bin Jiao ... Ting Zhang
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast, Simple and Accurate Handwritten Digit Classification by Training Shallow Neural Network Classifiers with the 'Extreme Learning Machine' Algorithm.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE