Abstract

Density-based clustering is a prominent and an essential technique in mining data streams. It can discover clusters of arbitrary or irregular shape and handle noisy data. This paper presents StreamSW, a new density-based approach for clustering streaming data over a sliding window (SW). A Sliding window is an extensively adopted window model for capturing and mining streaming data. The StreamSW approach adopts a two-phase framework for performing clustering on streaming data. In the online phase, the p-micro-clusters and grid structure are adopted to hold a synopsis of the streaming data. In the offline phase, final macro-clusters are created from the p-micro-clusters of online phase using the conventional DBSCAN algorithm. The StreamSW approach discovers the clusters with an irregular shape in limited memory and time. Experimental results exhibit that the speed and quality of StreamSW are better than the current approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.