Abstract

In this research paper, an incremental clustering approach-enabled MapReduce framework is implemented that include two phases, mapper and reducer phase. In the mapper phase, there are two processes, pre-processing and feature extraction. Once the input data is pre-processed, the feature extraction is done using wordnet features. Then, the features are fed to the reducer phase, where the features are selected using entropy function. Then, the automatic incremental clustering is done using bat-grey wolf optimizer (BAGWO). BAGWO is the integration of bat algorithm (BA) into grey wolf optimization (GWO) for generating various clusters of text documents. Upon the arrival of the incremental data, the mapping of the new data with respect to the centroids is done to obtain the effective cluster. For mapping, kernel-based deep point distance and for centroid update, fuzzy concept is used. The performance of the proposed framework outperformed the existing techniques using rand coefficient, Jaccard coefficient, and clustering accuracy with maximal values 0.921, 0.920, and 0.95, respectively.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.