Measuring the Prevalence of Problematic Respondent Behaviors among MTurk, Campus, and Community Participants.

Elizabeth A Necka,Stephanie Cacioppo,Greg J Norman,John T Cacioppo,Jelte M Wicherts

doi:10.1371/journal.pone.0157732

Elizabeth A Necka, Stephanie Cacioppo + Show 3 more

Open Access

https://doi.org/10.1371/journal.pone.0157732

Copy DOI

Journal: PLOS ONE	Publication Date: Jun 28, 2016
Citations: 132	License type: CC BY 4.0

Affiliation: University of Chicago

Abstract

The reliance on small samples and underpowered studies may undermine the replicability of scientific findings. Large sample sizes may be necessary to achieve adequate statistical power. Crowdsourcing sites such as Amazon’s Mechanical Turk (MTurk) have been regarded as an economical means for achieving larger samples. Because MTurk participants may engage in behaviors which adversely affect data quality, much recent research has focused on assessing the quality of data obtained from MTurk samples. However, participants from traditional campus- and community-based samples may also engage in behaviors which adversely affect the quality of the data that they provide. We compare an MTurk, campus, and community sample to measure how frequently participants report engaging in problematic respondent behaviors. We report evidence that suggests that participants from all samples engage in problematic respondent behaviors with comparable rates. Because statistical power is influenced by factors beyond sample size, including data integrity, methodological controls must be refined to better identify and diminish the frequency of participant engagement in problematic respondent behaviors.

Highlights

Concerns have been raised in recent years about the replicability of published scientific studies and the accuracy of reported effect sizes, which are often distorted as a function of underpowered research designs [1,2,3,4]
We examined whether Mechanical Turk (MTurk) participants engaged in potentially problematic respondent behaviors with greater frequency than participants from more traditional laboratory-based samples, and whether behavior among participants from more traditional samples is uniform across different laboratory-based sample types
The first orthogonal contrast revealed that MTurk participants were more likely than campus and community participants to complete a study while multitasking (t(512) = -5.90, p = 6.76E-9, d = .52), to leave the page of a study to return at a later point in time (t(512) = -4.72, p = 3.01E-6, d = .42), to look for studies by researchers they already know (t(512) = -9.57, p = 4.53E-20, d = .85), and to contact a researcher if they find a glitch in their survey (t(512) = -3.35, p = .001, d = .30)

Summary

Introduction

Concerns have been raised in recent years about the replicability of published scientific studies and the accuracy of reported effect sizes, which are often distorted as a function of underpowered research designs [1,2,3,4]. Data collected on MTurk have been shown to be generally comparable to data collected in the laboratory and the community for many psychological tasks, including cognitive, social, and judgment and decision making tasks [10,11,12,13]. This has generally been taken as evidence that data from MTurk are of high quality, reflecting an assumption that laboratory-based data collection is a gold standard in scientific research

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Measuring the Prevalence of Problematic Respondent Behaviors among MTurk, Campus, and Community Participants.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Is Amazon's Mechanical Turk (MTurk) a comparable recruitment source for trauma studies?
Krista Engle ... Margaret Talbot
Psychological Trauma: Theory, Research, Practice, and Policy | VOL. 12
Krista Engle, et. al.Krista Engle ... Margaret Talbot
01 May 2020
Psychological Trauma: Theory, Research, Practice, and Policy | VOL. 12

Validity and Mechanical Turk: An assessment of exclusion methods and interactive experiments
Kyle A Thomas ... Scott Clifford
Computers in Human Behavior | VOL. 77
Kyle A Thomas, et. al.Kyle A Thomas ... Scott Clifford
28 Aug 2017
Computers in Human Behavior | VOL. 77

An Analysis of Data Quality: Professional Panels, Student Subject Pools, and Amazon's Mechanical Turk
Jeremy Kees ... Kim Sheehan
Journal of Advertising | VOL. 46
Jeremy Kees, et. al.Jeremy Kees ... Kim Sheehan
02 Jan 2017
Journal of Advertising | VOL. 46

MTurk participants have substantially lower evaluative subjective well-being than other survey participants
Arthur A Stone ... Cheng K Wen
Computers in Human Behavior | VOL. 94
Arthur A Stone, et. al.Arthur A Stone ... Cheng K Wen
04 Jan 2019
Computers in Human Behavior | VOL. 94

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Measuring the Prevalence of Problematic Respondent Behaviors among MTurk, Campus, and Community Participants.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE