Abstract
In this paper is explored a way to reduce the rate of reclustering and speed up the clustering process on categorical time-evolving data. This method introduces two algorithms RDE (Replicated Data Elimination) and RCRDE. The RDE algorithm removes the successive surveys of replicated data and considers counters to keep this data. Hence the number of created windows via the sliding window technique is limited and this leads to decrease the number of implementations of clustering algorithm. The RCRDE algorithm based on MARDL (MAximal Resemblance Data Labeling) framework decides about re-clustering implementation or modification of previous clustering results. The presented method is independent of clustering algorithm’s type and any kind of categorical clustering algorithm can be used. According to the results obtained on different data sets, this method performs well in practice and facilitates the clustering implementation on categorical data. Also, this method can be utilized to cluster a very large categorical static database with higher quality than previous work.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.