Big Data Image Classification Based on Distributed Deep Representation Learning Model

Minjun Zhu,Qinghua Chen

doi:10.1109/access.2020.3011127

Abstract

Traditional image classification technology has become increasingly unable to meet the changing needs of the era of big data. With the open source use of a large number of marked databases and the development and promotion of computers with high performance, deep learning has moved from theory to practice and has been widely used in image classification. This paper takes big data image classification as the research object, selects distributed deep learning tools based on Spark cluster platform, and studies the image classification algorithm based on distributed deep learning. Aiming at the problems that the Labeled Structural Deep Network Embedding (LSDNE) model is applied to the attribute network and generates a large number of hyperparameters and the model complexity is too high, inspired by the Locally Linear Embedding (LLE) algorithm, this paper proposes a semi-supervised network based on the neighbor structure learning model. This model will add the neighbor information of the node at the same time when learning the network representation. Through the node vector reconstruction, the node itself and the neighbor node together constitute the next layer of representation. On the basis of Structural Labeled Locally Deep Nonlinear Embedding (SLLDNE), the node attribute is further added to propose Structural Informed Locally Distributed Deep Nonlinear Embedding (SILDDNE), and how the model combines the structural characteristics of the node with the attribute characteristics is explained in detail. The SVM classifier classifies the known labels, and SILDDNE fuses the network structure, labels, and node attributes into the deep neural network. The experimental results on the CIFAR-10 and CIFAR-100 datasets for image classification standard recognition tasks show that the proposed network achieves good classification performance and has a high generalization ability. Experiments on the CIFAR-10 data set show that the 34-layer SLLDNE pruned 40-layer Dense Net compresses about 50% of the parameter amount, increases the computational complexity efficiency by about 8 times, and reduces the classification error rate by 30%. Experiments on the CIFAR-100 data set show that the 34-layer SLLDNE parameter volume is compressed by about 16 times compared to the 19-layer VGG parameter volume, the computational complexity efficiency is increased by about 6 times, and the classification error rate is reduced by 14%.

Highlights

Big data is one of the main topics in the current information age, which determines the trend of economic and social ideology and cutting-edge technology research and development in recent years [1]–[3]
On the CIFAR-100 dataset, the 34-layer Structural Labeled Locally Deep Nonlinear Embedding (SLLDNE) parameter volume is compressed by nearly 16 times compared to the 19-layer VGG parameter volume, the computational complexity efficiency is increased by nearly 6 times, and the classification error rate is reduced by nearly 14%
The two tools of deep learning, Caffe and Caffe On Spark, are introduced, analyzed and compared, and their advantages and disadvantages are summarized, which lays the foundation for the distributed deep learning image classification studied in this article

Summary

INTRODUCTION

Big data is one of the main topics in the current information age, which determines the trend of economic and social ideology and cutting-edge technology research and development in recent years [1]–[3]. Caffe On Spark API supports dataframes, easy to connect to the training data set ready to use the Spark application, and extract the predicted value of the model or the characteristics of the middle layer, used for MLLib or SQL data analysis, and can optimize the scheduling of deep learning resources through YARN. It eliminates the traditional limitation of using a separate cluster for deep learning and the data movement that has to be done. Each hidden layer after that is obtained by the weighted reconstruction of the neighbors of the previous layer, and the vector representation of all nodes is z(A)

OBJECTIVE FUNCTION

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 12	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Big Data Image Classification Based on Distributed Deep Representation Learning Model

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Web-aided data set expansion in deep learning: evaluating trainable activation functions in ResNet for improved image classification
Zhiqiang Zhang ... Xinyi Xu
International Journal of Web Information Systems | VOL. 20
Zhiqiang Zhang, et. al.Zhiqiang Zhang ... Xinyi Xu
12 Jul 2024
International Journal of Web Information Systems | VOL. 20

Image Classification Using CNN with CIFAR-10 Dataset
Y Lokesh ... S Madhu
International Journal for Research in Applied Science and Engineering Technology | VOL. -
Y Lokesh, et. al.Y Lokesh ... S Madhu
30 Jun 2024
International Journal for Research in Applied Science and Engineering Technology | VOL. -

Manifold Learning Based on Sparse Neighbourhood Classification
Kaiyu Zhang
-
Kaiyu ZhangKaiyu Zhang
01 Oct 2019
01 Oct 2019

Image Enhancement and Classification of CIFAR-10 Using Convolutional Neural Networks
Siripuri Divya ... Bhaskar Adepu
-
Siripuri Divya, et. al.Siripuri Divya ... Bhaskar Adepu
20 Jan 2022
20 Jan 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Big Data Image Classification Based on Distributed Deep Representation Learning Model

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access