Statistically Robust Evaluation of Stream-Based Recommender Systems

Joao Vinagre,Joao Gama,Alipio Mario Jorge,Conceicao Rocha

doi:10.1109/tkde.2019.2960216

Abstract

Online incremental models for recommendation are nowadays pervasive in both the industry and the academia. However, there is not yet a standard evaluation methodology for the algorithms that maintain such models. Moreover, online evaluation methodologies available in the literature generally fall short on the statistical validation of results, since this validation is not trivially applicable to stream-based algorithms. We propose a <i xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">k</i> -fold validation framework for the pairwise comparison of recommendation algorithms that learn from user feedback streams, using prequential evaluation. Our proposal enables continuous statistical testing on adaptive-size sliding windows over the outcome of the prequential process, allowing practitioners and researchers to make decisions in real time based on solid statistical evidence. We present a set of experiments to gain insights on the sensitivity and robustness of two statistical tests-McNemar's and Wilcoxon signed rank-in a streaming data environment. Our results show that besides allowing a real-time, fine-grained online assessment, the online versions of the statistical tests are at least as robust as the batch versions, and definitely more robust than a simple prequential single-fold approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Dec 26, 2019
Citations: 10	License type: other-oa

R Discovery Prime

R Discovery Prime

Statistically Robust Evaluation of Stream-Based Recommender Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Similar Papers

Stern but Illuminating Words on Imaging: An Internist Replies
David L Keller
The American Journal of Medicine | VOL. 127
David L KellerDavid L Keller
16 Dec 2013
The American Journal of Medicine | VOL. 127

A Hadoop-Based Platform for Patient Classification and Disease Diagnosis in Healthcare Applications.
Hassan Harb ... Hussein Mroue
Sensors | VOL. 20
Hassan Harb, et. al.Hassan Harb ... Hussein Mroue
30 Mar 2020
Sensors | VOL. 20

About the Order Fulfillment Stage of the Manufacturing System Part 1
Daschievici Luiza ... Ghelase Daniela
EARTH SCIENCES AND HUMAN CONSTRUCTIONS | VOL. 2
Daschievici Luiza, et. al.Daschievici Luiza ... Ghelase Daniela
23 Mar 2022
EARTH SCIENCES AND HUMAN CONSTRUCTIONS | VOL. 2

Management and Adaptive Control of the Manufacturing System
Luiza Daschievici ... Daniela Ghelase
WSEAS TRANSACTIONS ON INFORMATION SCIENCE AND APPLICATIONS | VOL. 17
Luiza Daschievici, et. al.Luiza Daschievici ... Daniela Ghelase
20 Mar 2020
WSEAS TRANSACTIONS ON INFORMATION SCIENCE AND APPLICATIONS | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Statistically Robust Evaluation of Stream-Based Recommender Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering