Abstract

Density Based Spatial Clustering of Application with Noise (DBSCAN) is one of the mostly preferred algorithm among density based clustering approaches in unsupervised machine learning, which uses epsilon neighborhood construction strategy in order to discover arbitrary shaped clusters. DBSCAN separates dense regions from low density regions and simultaneously assigns points that lie alone as outliers to unearth the hidden cluster patterns in the datasets. DBSCAN identifies dense regions by means of core point definition, detection of which are strictly dependent on input parameter definitions: ε is distance of the neighborhood or radius of hypersphere and MinPts is minimum density constraint inside ε radius hypersphere. Contrarily to classical DBSCAN’s crisp core point definition, intuitionistic fuzzy core point definition is proposed in our preliminary work to make DBSCAN algorithm capable of detecting different patterns of density by two different combinations of input parameters, particularly is a necessity for the density varying large datasets in multidimensional feature space. In this study, preliminarily proposed DBSCAN extension is studied: IFDBSCAN. The proposed extension is tested by computational experiments on several machine learning repository real-time datasets. Results show that, IFDBSCAN is superior to classical DBSCAN with respect to external & internal performance indices such as purity index, adjusted rand index, Fowlkes-Mallows score, silhouette coefficient, Calinski-Harabasz index and with respect to clustering structure results without increasing computational time so much, along with the possibility of trying two different density patterns on the same run and trying intermediary density values for the users by manipulating α margin.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call