OREO

Yin Lin,Yuval Moskovitch,Tova Milo,Brit Youngmann,H V Jagadish

doi:10.14778/3554821.3554846

Abstract

Data analytics often make sense of large data sets by generalization: aggregating from the detailed data to a more general context. Given a dataset, misleading generalizations can sometimes be drawn from a cherry-picked level of aggregation to obscure substantial subgroups that oppose the generalization. Our goal is to detect and explain cherry-picked generalizations by refining the corresponding aggregate queries. We demonstrate OREO, a system to compute a support score of the given statement to quantify the quality of the generalization; that is, whether the aggregated result is an accurate reflection of the data. To better understand the resulting score, our system also identifies significant counterexamples and alternative statements that better represent the data at hand. We will demonstrate the utility of OREO for investigating generalizations, by interacting with the VLDB'22 participants who will use the OREO interface for statement validation and explanation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

OREO

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Similar Papers

Error-bounded sampling for analytics on big sparse data
Ying Yan ... Liang Jeff Chen
Proceedings of the VLDB Endowment | VOL. 7
Ying Yan, et. al.Ying Yan ... Liang Jeff Chen
01 Aug 2014
Proceedings of the VLDB Endowment | VOL. 7

Learning to accurately COUNT with query-driven predictive analytics
Christos Anagnostopoulos ... Peter Triantafillou
-
Christos Anagnostopoulos, et. al.Christos Anagnostopoulos ... Peter Triantafillou
01 Oct 2015
01 Oct 2015

Modern Privacy Risks and Protection Strategies in Data Analytics
Narsingrao Vasupula ... Vazralu Munnangi
-
Narsingrao Vasupula, et. al.Narsingrao Vasupula ... Vazralu Munnangi
24 Jul 2021
24 Jul 2021

DATA SCIENCE: Data Visualization and Data Analytics in the Process of Data Mining
Ijsrem Journal
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 08
Ijsrem JournalIjsrem Journal
25 Jan 2024
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 08

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

OREO

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment