CREDBANK: A Large-Scale Social Media Corpus With Associated Credibility Annotations

Tanushree Mitra,Eric Gilbert

doi:10.1609/icwsm.v9i1.14625

Abstract

Social media has quickly risen to prominence as a news source, yet lingering doubts remain about its ability to spread rumor and misinformation. Systematically studying this phenomenon, however, has been difficult due to the need to collect large-scale, unbiased data along with in-situ judgements of its accuracy. In this paper we present CREDBANK, a corpus designed to bridge this gap by systematically combining machine and human computation. Specifically, CREDBANK is a corpus of tweets, topics, events and associated human credibility judgements. It is based on the real-time tracking of more than 1 billion streaming tweets over a period of more than three months, computational summarizations of those tweets, and intelligent routings of the tweet streams to human annotators — within a few hours of those events unfolding on Twitter. In total CREDBANK comprises more than 60 million tweets grouped into 1049 real-world events, each annotated by 30 human annotators. As an example, with CREDBANK one can quickly calculate that roughly 24% of the events in the global tweet stream are not perceived as credible. We have made CREDBANK publicly available, and hope it will enable new research questions related to online information credibility in fields such as social science, data mining and health.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CREDBANK: A Large-Scale Social Media Corpus With Associated Credibility Annotations

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media

Lead the way for us

Journal: Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media	Publication Date: Aug 3, 2021
Citations: 91

Similar Papers

Mining One Percent of Twitter: Collections, Baselines, Sampling
Carolin Gerlitz ... Bernhard Rieder
M/C Journal | VOL. 16
Carolin Gerlitz, et. al.Carolin Gerlitz ... Bernhard Rieder
02 Mar 2013
M/C Journal | VOL. 16

Assessing the level of knowledge, attitudes, and beliefs about Ebola virus disease among college students
Thrissia Koralek ... Miryha Gould Runnerstrom
American journal of infection control | VOL. 43
Thrissia Koralek, et. al.Thrissia Koralek ... Miryha Gould Runnerstrom
29 Jul 2015
American journal of infection control | VOL. 43

Hybrid Intelligent Techniques in Text Mining and Analysis of Social Networks and Media Data
Neha Golani ... Ishan Khandelwal
-
Neha Golani, et. al.Neha Golani ... Ishan Khandelwal
01 Jan 2017
01 Jan 2017

Fair Game? User Evaluations of Social Media Data Mining
Helen Kennedy
-
Helen KennedyHelen Kennedy
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CREDBANK: A Large-Scale Social Media Corpus With Associated Credibility Annotations

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media