Shared Subscribe Hyper Simulation Optimization (SUBHSO) Algorithm for Clustering Big Data – Using Big Databases of Iran Electricity Market

Mesbaholdin Salami,Farzad Movahedi Sobhani,Mohammad Sadegh Ghazizadeh

doi:10.2478/acss-2019-0007

Mesbaholdin Salami, Farzad Movahedi Sobhani + Show 1 more

Open Access

https://doi.org/10.2478/acss-2019-0007

Copy DOI

Abstract

Abstract Many real world problems have big data, including recorded fields and/or attributes. In such cases, data mining requires dimension reduction techniques because there are serious challenges facing conventional clustering methods in dealing with big data. The subspace selection method is one of the most important dimension reduction techniques. In such methods, a selected set of subspaces is substituted for the general dataset of the problem and clustering is done using this set. This article introduces the Shared Subscribe Hyper Simulation Optimization (SUBHSO) algorithm to introduce the optimized cluster centres to a set of subspaces. SUBHSO uses an optimization loop for modifying and optimizing the coordinates of the cluster centres with the particle swarm optimization (PSO) and the fitness function calculation using the Monte Carlo simulation. The case study on the big data of Iran electricity market (IEM) has shown the improvement of the defined fitness function, which represents the cluster cohesion and separation relative to other dimension reduction algorithms.

Highlights

Conventional data mining methods are not suitable for big data analysis since they pose serious challenges to a variety of distance measurements in a reasonable time
The correlation-based clustering methods are used for a set of non-correlated dimensions, in which clusters are created in a new space or its subspaces
The concentration is on the calculation of subspaces of data, which avoids computational complications without affecting the clustering accuracy

Summary

INTRODUCTION

Conventional data mining methods are not suitable for big data analysis since they pose serious challenges to a variety of distance measurements in a reasonable time. The other goal is to find a set of attributes that clearly reflect the similarity of data in a dataset To this end, many subspace clustering methods have developed to solve the problem of data analysis in a fulldata space. In all of these problems, the aim is to select a set of subspaces as an appropriate substitute for the whole dataset This category has a higher accuracy than other methods, but with a large amount of data, their precision is reduced because the subspaces cannot be good representation for the entire data. This article proposes a hybrid clustering algorithm, namely, Shared Subscribe Hyper Simulation Optimization (SUBHSO) This algorithm is proposed to solve the problem of limiting the subspace method in the analysis of large data. The rest of this paper includes model component (Section 2), Iran electricity market and big data (Section 3), data clustering with the proposed algorithm (Section 4), comparison of the proposed algorithm with predecessors in terms of execution (Section 5) and validation of the proposed algorithm (Section 6)

MODEL COMPONENTS

SUBHSO Algorithm

IRAN ELECTRICITY MARKET AND BIG DATA

DATA CLUSTERING WITH THE PROPOSED ALGORITHM

COMPARISON OF THE PROPOSED ALGORITHM WITH PREDECESSORS IN TERMS OF EXECUTION

VALIDATION OF THE PROPOSED ALGORITHM

CONCLUSION AND RECOMMENDATIONS

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Computer Systems	Publication Date: May 1, 2019
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Shared Subscribe Hyper Simulation Optimization (SUBHSO) Algorithm for Clustering Big Data – Using Big Databases of Iran Electricity Market

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Computer Systems

Lead the way for us

Similar Papers

Big data analytics in Industry 4.0 ecosystems
Gagangeet Singh Aujla ... Radu Prodan
Software: Practice and Experience | VOL. 52
Gagangeet Singh Aujla, et. al.Gagangeet Singh Aujla ... Radu Prodan
11 Jun 2021
Software: Practice and Experience | VOL. 52

User-guided Dimensionality Reduction Ensembles
Gladys M Hilasaca ... Fernando V Paulovich
-
Gladys M Hilasaca, et. al.Gladys M Hilasaca ... Fernando V Paulovich
01 Jul 2019
01 Jul 2019

Visualization of big high dimensional data in a three dimensional space
Ying Xie ... Jing (Selena) He
-
Ying Xie, et. al.Ying Xie ... Jing (Selena) He
06 Dec 2016
06 Dec 2016

Extreme Learning Machines for approximating nonlinear dimensionality reduction mappings: Application to Haptic handwritten signatures
Julio J Valdes ... Fawaz A Alsulaiman
-
Julio J Valdes, et. al.Julio J Valdes ... Fawaz A Alsulaiman
01 Jul 2016
01 Jul 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Shared Subscribe Hyper Simulation Optimization (SUBHSO) Algorithm for Clustering Big Data – Using Big Databases of Iran Electricity Market

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Computer Systems