Using Simpson’s Paradox to Discover Interesting Patterns in Behavioral Data

Nazanin Alipourfard,Kristina Lerman,Peter Fennell

doi:10.1609/icwsm.v12i1.15017

Abstract

We describe a data-driven discovery method that leverages Simpson's paradox to uncover interesting patterns in behavioral data. Our method systematically disaggregates data to identify subgroups within a population whose behavior deviates significantly from the rest of the population. Given an outcome of interest and a set of covariates, the method follows three steps. First, it disaggregates data into subgroups, by conditioning on a particular covariate, so as minimize the variation of the outcome within the subgroups. Next, it models the outcome as a linear function of another covariate, both in the subgroups and in the aggregate data. Finally, it compares trendsto identify disaggregations that produce subgroups with different behaviors from the aggregate.We illustrate the method by applying it to three real-world behavioral datasets, including Q\&A site Stack Exchange and online learning platforms Khan Academy and Duolingo.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using Simpson’s Paradox to Discover Interesting Patterns in Behavioral Data

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International AAAI Conference on Web and Social Media

Lead the way for us

Journal: Proceedings of the International AAAI Conference on Web and Social Media	Publication Date: Jun 15, 2018
Citations: 8

Similar Papers

PROTOCOL: Mass deworming for improving health and cognition of children in endemic helminth areas: a systematic review and individual participant data network meta-analysis.
Vivian Welch ... Alomgir Hossain
Campbell Systematic Reviews | VOL. 14
Vivian Welch, et. al.Vivian Welch ... Alomgir Hossain
01 Jan 2018
Campbell Systematic Reviews | VOL. 14

Applying machine learning techniques on feeding behavior data for early estrus detection in dairy heifers
F.C Cairo ... J.R.R Dorea
Computers and Electronics in Agriculture | VOL. 179
F.C Cairo, et. al.F.C Cairo ... J.R.R Dorea
01 Nov 2020
Computers and Electronics in Agriculture | VOL. 179

Taking the aggravation out of data aggregation: A conceptual guide to dealing with statistical issues related to the pooling of individual-level observational data.
Thomas V Pollet ... S Peter Henzi
American Journal of Primatology | VOL. 77
Thomas V Pollet, et. al.Thomas V Pollet ... S Peter Henzi
24 Mar 2015
American Journal of Primatology | VOL. 77

Detection of Interesting Traffic Accident Patterns by Association Rule Mining
Harisha Donepudi
-
Harisha DonepudiHarisha Donepudi
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using Simpson’s Paradox to Discover Interesting Patterns in Behavioral Data

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International AAAI Conference on Web and Social Media