Mitigating the effect of dataset shift in clustering

Sebastián Maldonado,Ramiro Saltos,Carla Vairetti,José Delpiano

doi:10.1016/j.patcog.2022.109058

Sebastián Maldonado, Ramiro Saltos + Show 2 more

https://doi.org/10.1016/j.patcog.2022.109058

Copy DOI

Abstract

Dataset shift is a relevant topic in unsupervised learning since many applications face evolving environments, causing an important loss of generalization and performance. Most techniques that deal with this issue are designed for data stream clustering, whose goal is to process sequences of data efficiently under Big Data. In this study, we claim dataset shift is an issue for static clustering tasks in which data is collected over a long period. To mitigate it, we propose Time-weighted kernel k-means, a k-means variant that includes a time-dependent weighting process. We do this via the induced ordered weighted average (IOWA) operator. The weighting process acts as a gradual forgetting mechanism, prioritizing recent examples over outdated ones in the clustering algorithm. The computational experiments show the potential Time-weighted kernel k-means has in evolving environments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mitigating the effect of dataset shift in clustering

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Sep 23, 2022
Citations: 3

Similar Papers

Some induced ordered weighted averaging operators and their use for solving group decision-making problems based on fuzzy preference relations
F Chiclana ... S Alonso
European Journal of Operational Research | VOL. 182
F Chiclana, et. al.F Chiclana ... S Alonso
01 Oct 2007
European Journal of Operational Research | VOL. 182

A Combination Forecasting Model Based on IOWA Operator for Dam Safety Monitoring
Yan Bin ... Yu Hai-Bo
-
Yan Bin, et. al. Yan Bin ... Yu Hai-Bo
01 Jan 2013
01 Jan 2013

Combination load forecasting method for CCHP system based on IOWA operator
Yunxin Sun ... Chenghui Zhang
-
Yunxin Sun, et. al.Yunxin Sun ... Chenghui Zhang
01 Oct 2017
01 Oct 2017

Extended IOWA Operator and ITS Application to Group Decision Making with Linguistic Preference Information
Gang Qian ... Ze-Shui Xu
-
Gang Qian, et. al.Gang Qian ... Ze-Shui Xu
01 Aug 2006
01 Aug 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mitigating the effect of dataset shift in clustering

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition