Detecting virtual concept drift of regressors without ground truth values

Emilia Oikarinen,Henri Tiittanen,Kai Puolamäki,Andreas Henelius

doi:10.1007/s10618-021-00739-7

Emilia Oikarinen, Henri Tiittanen + Show 2 more

Open Access

https://doi.org/10.1007/s10618-021-00739-7

Copy DOI

Abstract

Regression analysis is a standard supervised machine learning method used to model an outcome variable in terms of a set of predictor variables. In most real-world applications the true value of the outcome variable we want to predict is unknown outside the training data, i.e., the ground truth is unknown. Phenomena such as overfitting and concept drift make it difficult to directly observe when the estimate from a model potentially is wrong. In this paper we present an efficient framework for estimating the generalization error of regression functions, applicable to any family of regression functions when the ground truth is unknown. We present a theoretical derivation of the framework and empirically evaluate its strengths and limitations. We find that it performs robustly and is useful for detecting concept drift in datasets in several real-world domains.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Data Mining and Knowledge Discovery	Publication Date: Feb 4, 2021
Citations: 15	License type: open-access

R Discovery Prime

R Discovery Prime

Detecting virtual concept drift of regressors without ground truth values

Abstract

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery

Lead the way for us

Similar Papers

Learning under Concept Drift: A Review
Jie Lu ... Feng Gu
IEEE Transactions on Knowledge and Data Engineering | VOL. 31
Jie Lu, et. al.Jie Lu ... Feng Gu
01 Jan 2018
IEEE Transactions on Knowledge and Data Engineering | VOL. 31

Accumulating regional density dissimilarity for concept drift detection in data streams
Anjin Liu ... Guangquan Zhang
Pattern Recognition | VOL. 76
Anjin Liu, et. al.Anjin Liu ... Guangquan Zhang
07 Nov 2017
Pattern Recognition | VOL. 76

Integrated detection and localization of concept drifts in process mining with batch and stream trace clustering support
Rafael Gaspar De Sousa ... Hajo Alexander Reijers
Data & Knowledge Engineering | VOL. 149
Rafael Gaspar De Sousa, et. al.Rafael Gaspar De Sousa ... Hajo Alexander Reijers
02 Dec 2023
Data & Knowledge Engineering | VOL. 149

Chaotic Ant Swarm based Feature Subset Selection with Concept Drift Detection and Classification Model for Data Streaming Applications
S Caxton Emerald ... T Vengattaraman
-
S Caxton Emerald, et. al.S Caxton Emerald ... T Vengattaraman
23 Feb 2022
23 Feb 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detecting virtual concept drift of regressors without ground truth values

Abstract

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery