Selection Of Centroids Research Articles

The RBF networks belong to a set of artificial neural network architectures. RBF networks have been successfully applied for solving various data mining tasks including classification, and regression. Successful implementation of the RBF network depends on numerous factors among which, the crucial is its structure. The decision on the network structure has to be taken at the network initialization stage. It requires calculating or inducing the number of centroids, and their respective locations. The above problem is known to be NP-hard, and hence, not easily solvable. The traditional approach for deciding on the number of hidden units is based on applying the k-means algorithm for calculating cluster centroids. Unfortunately, the procedure guarantees neither a satisfactory accuracy nor the required generalization level of the RBF network under development. To alleviate the problem for cluster determination, i.e. number of centroids, we propose the similarity-based algorithm (SCA) for the RBF networks initialization, as well as an alternative method for initializing RBFNs using the kernel-based fuzzy clustering algorithm (KFCM-K). In both cases, the number of resulting centroids and their initial locations are provided automatically. The next step involves applying the optimization procedure resulting in the selection of the final centroids’ location. The procedure is integrated with the output weights determination. Since the discussed optimization problem is computationally difficult it has been decided to apply the agent-based population learning algorithm (PLA) which belongs to the class of metaheuristics. A comparative study of approaches based on SCA and KFCM-K is included in the paper. Their effectiveness is demonstrated experimentally using artificial and real benchmark datasets. The results of the computational experiment have shown that both proposed approaches for designing RBFNs perform significantly better than other algorithms used for this task.

Read full abstract

Today, semi-structured and unstructured data are mainly collected and analyzed for data analysis applicable to various systems. Such data have a dense distribution of space and usually contain outliers and noise data. There have been ongoing research studies on clustering algorithms to classify such data (outliers and noise data). The K-means algorithm is one of the most investigated clustering algorithms. Researchers have pointed out a couple of problems such as processing clustering for the number of clusters, K, by an analyst through his or her random choices, producing biased results in data classification through the connection of nodes in dense data, and higher implementation costs and lower accuracy according to the selection models of the initial centroids. Most K-means researchers have pointed out the disadvantage of outliers belonging to external or other clusters instead of the concerned ones when K is big or small. Thus, the present study analyzed problems with the selection of initial centroids in the existing K-means algorithm and investigated a new K-means algorithm of selecting initial centroids. The present study proposed a method of cutting down clustering calculation costs by applying an initial center point approach based on space division and outliers so that no objects would be subordinate to the initial cluster center for dependence lower from the initial cluster center. Since data containing outliers could lead to inappropriate results when they are reflected in the choice of a center point of a cluster, the study proposed an algorithm to minimize the error rates of outliers based on an improved algorithm for space division and distance measurement. The performance experiment results of the proposed algorithm show that it lowered the execution costs by about 13–14% compared with those of previous studies when there was an increase in the volume of clustering data or the number of clusters. It also recorded a lower frequency of outliers, a lower effectiveness index, which assesses performance deterioration with outliers, and a reduction of outliers by about 60%.

Read full abstract

Selection Of Centroids Research Articles

Related Topics

Articles published on Selection Of Centroids

Optimal Text Document Clustering Enabled by Weighed Similarity Oriented Jaya With Grey Wolf Optimization Algorithm

Cross domain-based ontology construction via Jaccard Semantic Similarity with hybrid optimization model

Detection and Correction of Abnormal Data with Optimized Dirty Data: A New Data Cleaning Model

Building a training dataset for classification under a cost limitation

Improved Mean Shift Algorithm for Maximizing Clustering Accuracy

Optimisation of K-means algorithm based on sample density canopy

An Indexing Algorithm Based on Clustering of Minutia Cylinder Codes for Fast Latent Fingerprint Identification

An efficient distance estimation and centroid selection based on k-means clustering for small and large dataset

Designing RBFNs Structure Using Similarity-Based and Kernel-Based Fuzzy C-Means Clustering Algorithms

Pengelompokan Komentar Dataset Sentipol dengan Modified K-Means Clustering

PENINGKATAN KINERJA ALGORITMA K MEANS DENGAN MENGGUNAKAN PARTICLE SWARM OPTIMIZATION DALAM PENGELOMPOKAN DATA PENYEDIAAN AKSES

DISCERN: diversity-based selection of centroids for k-estimation and rapid non-stochastic clustering

An Automatic Centroid Image Selection Method Based on Fuzzy Logic Reasoning in Image Deduplication

Selecting the shape of centroids of round and non-round gears

A Novel Model on Reinforce K-Means Using Location Division Model and Outlier of Initial Value for Lowering Data Cost.

Quality and size assessment of quantized images using K-Means++ clustering

Segmentation of Leaf Spots Disease in Apple Plants Using Particle Swarm Optimization and K-means Algorithm

Knowledge based fuzzy c-means method for rapid brain tissues segmentation of magnetic resonance imaging scans with CUDA enabled GPU machine

A novel Tag Score (T_S) model with improved K-means for clustering tweets

Automatic cloud segmentation from INSAT‐3D satellite image via IKM and IFCM clustering

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Selection Of Centroids Research Articles

Related Topics

Articles published on Selection Of Centroids

Optimal Text Document Clustering Enabled by Weighed Similarity Oriented Jaya With Grey Wolf Optimization Algorithm

Cross domain-based ontology construction via Jaccard Semantic Similarity with hybrid optimization model

Detection and Correction of Abnormal Data with Optimized Dirty Data: A New Data Cleaning Model

Building a training dataset for classification under a cost limitation

Improved Mean Shift Algorithm for Maximizing Clustering Accuracy

Optimisation of K-means algorithm based on sample density canopy

An Indexing Algorithm Based on Clustering of Minutia Cylinder Codes for Fast Latent Fingerprint Identification

An efficient distance estimation and centroid selection based on k-means clustering for small and large dataset

Designing RBFNs Structure Using Similarity-Based and Kernel-Based Fuzzy C-Means Clustering Algorithms

Pengelompokan Komentar Dataset Sentipol dengan Modified K-Means Clustering

PENINGKATAN KINERJA ALGORITMA K MEANS DENGAN MENGGUNAKAN PARTICLE SWARM OPTIMIZATION DALAM PENGELOMPOKAN DATA PENYEDIAAN AKSES

DISCERN: diversity-based selection of centroids for k-estimation and rapid non-stochastic clustering

An Automatic Centroid Image Selection Method Based on Fuzzy Logic Reasoning in Image Deduplication

Selecting the shape of centroids of round and non-round gears

A Novel Model on Reinforce K-Means Using Location Division Model and Outlier of Initial Value for Lowering Data Cost.

Quality and size assessment of quantized images using K-Means++ clustering

Segmentation of Leaf Spots Disease in Apple Plants Using Particle Swarm Optimization and K-means Algorithm

Knowledge based fuzzy c-means method for rapid brain tissues segmentation of magnetic resonance imaging scans with CUDA enabled GPU machine

A novel Tag Score (T_S) model with improved K-means for clustering tweets

Automatic cloud segmentation from INSAT‐3D satellite image via IKM and IFCM clustering