Abstract

The fuzzy C-Means algorithm is a partition-based clustering algorithm.Fuzzy C-Means is very helpful in modeling data whose distribution has outliers. Outliers are where there is a data object that is far apart from the existing clusters. Fuzzy C-Means groups data by minimizing the membership function of a data set. so that each piece of data can be a member of more than one group. In this study, the dataset used was the paper citation vs. H-index dataset in the Kaggle.com repository. This dataset is known to have outliers in fuzzy C-Means and has better performance compared to the K-Means and K-Medoid algorithms in modeling datasets that have outliers.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.