Training neural networks on high-dimensional data using random projection

Piotr Iwo Wójcik,Marcin Kurdziel

doi:10.1007/s10044-018-0697-0

Piotr Iwo Wójcik, Marcin Kurdziel

Open Access

https://doi.org/10.1007/s10044-018-0697-0

Copy DOI

Journal: Pattern Analysis and Applications	Publication Date: Mar 19, 2018
Citations: 45	License type: open-access

Affiliation: AGH University of Krakow

Abstract

Training deep neural networks (DNNs) on high-dimensional data with no spatial structure poses a major computational problem. It implies a network architecture with a huge input layer, which greatly increases the number of weights, often making the training infeasible. One solution to this problem is to reduce the dimensionality of the input space to a manageable size, and then train a deep network on a representation with fewer dimensions. Here, we focus on performing the dimensionality reduction step by randomly projecting the input data into a lower-dimensional space. Conceptually, this is equivalent to adding a random projection (RP) layer in front of the network. We study two variants of RP layers: one where the weights are fixed, and one where they are fine-tuned during network training. We evaluate the performance of DNNs with input layers constructed using several recently proposed RP schemes. These include: Gaussian, Achlioptas’, Li’s, subsampled randomized Hadamard transform (SRHT) and Count Sketch-based constructions. Our results demonstrate that DNNs with RP layer achieve competitive performance on high-dimensional real-world datasets. In particular, we show that SRHT and Count Sketch-based projections provide the best balance between the projection time and the network performance.

Highlights

Deep-learning methods excel in many classical machine learning tasks, such as image and speech recognition or sequence modelling [1]
We study two ways of training this architecture: one where the parameters of the random projection (RP) layer are fixed during training, and one where they are fine-tuned with error backpropagation
We studied the viability of training deep neural networks with random projection layer

Summary

Introduction

Deep-learning methods excel in many classical machine learning tasks, such as image and speech recognition or sequence modelling [1]. The motivation for this work stems from the problem of training DNNs on unstructured data with a large number of dimensions. When there is no exploitable input structure, training DNNs on highdimensional data poses a significant computational problem The reason for this is the implied network architecture, and in particular an input layer which may contain billions of weights. Even with recent advances in GPGPU computing, training networks with this number of parameters is infeasible Learning in such applications is often performed with linear classifiers, usually support vector machines or logistic regression [3]. We show that this problem can be solved by incorporating random projection into the network architecture.

Random projection matrices

Neural networks with random projection layer

Fixed‐weight random projection layer

Fine‐tuned random projection layer

Experiments on synthetic datasets

Experiments on real‐world datasets

Related work

Findings

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Training neural networks on high-dimensional data using random projection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Pattern Analysis and Applications

Lead the way for us

Similar Papers

Deep Cascade Learning.
Enrique S Marquez ... Mahesan Niranjan
IEEE Transactions on Neural Networks and Learning Systems | VOL. 29
Enrique S Marquez, et. al.Enrique S Marquez ... Mahesan Niranjan
06 Mar 2018
IEEE Transactions on Neural Networks and Learning Systems | VOL. 29

Dynamic Memory Management for GPU-Based Training of Deep Neural Networks
Shriram S.B ... Purushottam Kulkarni
-
Shriram S.B, et. al.Shriram S.B ... Purushottam Kulkarni
01 May 2019
01 May 2019

Mutual calibration training: Training deep neural networks with noisy labels using dual-models
Rui Liu ... Yucong Zhou
Computer Vision and Image Understanding | VOL. 212
Rui Liu, et. al.Rui Liu ... Yucong Zhou
13 Sep 2021
Computer Vision and Image Understanding | VOL. 212

Neuroevolution in Deep Neural Networks: Current Trends and Future Challenges
Edgar Galvan ... Peter Mooney
IEEE Transactions on Artificial Intelligence | VOL. 2
Edgar Galvan, et. al.Edgar Galvan ... Peter Mooney
04 May 2021
IEEE Transactions on Artificial Intelligence | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Training neural networks on high-dimensional data using random projection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Pattern Analysis and Applications