Evolving clustering algorithm based on mixture of typicalities for stream data mining

José Maia,Carlos Alberto Severiano,Frederico Gadelha Guimarães,Cristiano Leite De Castro,André Paim Lemos,Juan Camilo Fonseca Galindo,Miri Weiss Cohen

doi:10.1016/j.future.2020.01.017

Abstract

Many applications have been producing streaming data nowadays, which motivates techniques to extract knowledge from such sources. In this sense, the development of data stream clustering algorithms has gained an increasing interest. However, the application of these algorithms in real systems remains a challenge, since data streams often come from non-stationary environments, which can affect the choice of a proper set of model parameters for fitting the data or finding a correct number of clusters. This work proposes an evolving clustering algorithm based on a mixture of typicalities. It is based on the TEDA framework and divide the clustering problem into two subproblems: micro-clusters and macro-clusters. Experimental results with benchmarking data sets showed that the proposed methodology can provide good results for clustering data and estimating its density even in the presence of events that can affect data distribution parameters, such as concept drifts. In addition, the model parameters were robust in relation to the state-of-the-art algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evolving clustering algorithm based on mixture of typicalities for stream data mining

Abstract

Talk to us

Similar Papers

More From: Future Generation Computer Systems

Lead the way for us

Journal: Future Generation Computer Systems	Publication Date: Jan 24, 2020
Citations: 29

Similar Papers

Online embedding and clustering of evolving data streams
Alaettin Zubaroğlu ... Volkan Atalay
Statistical Analysis and Data Mining: The ASA Data Science Journal | VOL. 16
Alaettin Zubaroğlu, et. al.Alaettin Zubaroğlu ... Volkan Atalay
06 Jul 2022
Statistical Analysis and Data Mining: The ASA Data Science Journal | VOL. 16

Multi-Source Transfer Learning for Non-Stationary Environments
Honghui Du ... Leandro L Minku
-
Honghui Du, et. al.Honghui Du ... Leandro L Minku
01 Jul 2019
01 Jul 2019

Online Embedding and Clustering of Data Streams
Alaettin Zubaroğlu ... Volkan Atalay
-
Alaettin Zubaroğlu, et. al.Alaettin Zubaroğlu ... Volkan Atalay
20 Nov 2019
20 Nov 2019

Expressive and modular rule-basedclassifier for data streams

-

31 Jul 2019
31 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evolving clustering algorithm based on mixture of typicalities for stream data mining

Abstract

Talk to us

Similar Papers

More From: Future Generation Computer Systems