Abstract

The Type-2 fuzzy set (T2 FS) is widely used for efficient control uncertainties, such as noise sensitivity in the fuzzy set. In addition, unsupervised machine learning requires a clustering parameter value in advance, and may affect clustering performance according to prior information such as the number and size of clusters. In this case, the fuzzifier value m to be applied is the most important factor in improving the accuracy of data. Therefore, in this paper, we intend to perform clustering to automatically acquire the determination of m 1 and m 2 values that depended on existing repeated experiments. To this end, in order to increase efficiency on deriving appropriate fuzzifier value, we used the Interval type-2 possibilistic fuzzy C-means (IT2PFCM), clustering method to classify a given pattern. In Efficient IT2PFCM method, used for clustering, we propose an algorithm that derives suitable fuzzifier values for each data. These values also extract information from each data point through the histogram approach and Gaussian Curve Fitting method. Using the extracted information, two adaptive fuzzifier value m 1 and m 2 are determined. Obtained values apply the new lowest and highest membership values. In addition, it is possible to form an appropriate fuzzy area on each cluster by only taking advantage of the characteristics of IT2PFCM, which reduces uncertainty. This doesn't only improve the accuracy of clustering of measured sensor data, but can also be used without additional procedures such as data labeling or the provision of prior information. It is also efficient at monitoring numerous sensors, managing and verifying sensor data collected in real time such as smart cities. Eventually, in this study, the proposed method is to improve IT2PFCM performance on accurate and quick clustering of large amount of complex data such as Internet of Things (IoT).

Highlights

  • Clustering is the process of grouping similar entities together, taking specific predefined features or attributes into consideration

  • The data points are divided into distinct sets, that is, a single data point belongs to only one cluster, whereas in soft clustering, data points have a fuzzy membership in a cluster, that is, a particular data point belongs to more than one cluster, containing different membership value

  • As the needs on developing new method to adaptively determining the fuzzifier value for different kinds of data are growing, this paper proposes a method using a histogram based on the Interval type-2 possibilistic Fuzzy C-means (IT2 PFCM) clustering method

Read more

Summary

INTRODUCTION

Clustering is the process of grouping similar entities together, taking specific predefined features or attributes into consideration. PFCM uses the fuzzifier that is denoted by m, which determines the membership values, and the bandwidth parameter that is used to evaluate the typicality values [5]. The existing research has been conducted to measure the optimum range according to the upper and lower bounds of the fuzzifier value through several repeated experiments [6] These studies are ongoing, the same fuzzy constant range cannot be applied to every data [7]. As the needs on developing new method to adaptively determining the fuzzifier value for different kinds of data are growing, this paper proposes a method using a histogram based on the Interval type-2 possibilistic Fuzzy C-means (IT2 PFCM) clustering method.

BACKGROUND
DETERMINING THE RANGE OF FUZZIFIER VALUE
DETERMINATION OF FUZZIFIER VALUE USING HISTOGRAM
APPLICATION TO SENSOR DATA
Findings
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.