Net activism and whistleblowing on YouTube: a text mining analysis.

Nicolas Turenne

doi:10.1007/s11042-022-13777-0

Nicolas Turenne

Open Access

https://doi.org/10.1007/s11042-022-13777-0

Copy DOI

Abstract

Social media is more and more dominant in everyday life for people around the world. YouTube content is a resource that may be useful, in social computational science, for understanding key questions about society. Using this resource, we performed web scraping to create a dataset of 644,575 video transcriptions concerning net activism and whistleblowing. We automatically performed linguistic feature extraction to capture a representation of each video using its title, description and transcription (downloaded metadata). The next step was to clean the dataset using automatic clustering with linguistic representation to identify unmatched videos and noisy keywords. Using these keywords to exclude videos, we finally obtained a dataset that was reduced by 95%, i.e., it contained 35,730 video transcriptions. Then, we again automatically clustered the videos using a lexical representation and split the dataset into subsets, leading to hundreds of clusters that we interpreted manually to identify a hierarchy of topics of interest concerning whistleblowing. We used the dataset to learn a lexical representation for a specific topic and to detect unknown whistleblowing videos for this topic; the accuracy of this detection is 57.4%. We also used the dataset to identify interesting context linguistic markers around the names of whistleblowers. From a given list of names, we automatically extracted all 5-g word sequences from the dataset and identified interesting markers in the left and right contexts for each name by manual interpretation. The results of our study are the following: a dataset (raw and cleaned collections) concerning whistleblowing, a hierarchy of topics about whistleblowing, the automatic prediction of whistleblowing and the semi-automatic semantic analysis of markers around whistleblower names. This text mining analysis can be exploited for digital sociology and e-democracy studies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Multimedia tools and applications	Publication Date: Sep 29, 2022
Citations: 1	License type: NO-CC CODE

R Discovery Prime

R Discovery Prime

Net activism and whistleblowing on YouTube: a text mining analysis.

Abstract

Talk to us

Similar Papers

More From: Multimedia tools and applications

Lead the way for us

Similar Papers

Assessing Patients Perception: Analyzing the Quality, Reliability, Comprehensibility, and the Mentioned Medical Concepts of Traumatic Brain Injury Videos on YouTube
Mustafa Hüseyin Temel ... Fatih Bağcıer
World Neurosurgery | VOL. 185
Mustafa Hüseyin Temel, et. al.Mustafa Hüseyin Temel ... Fatih Bağcıer
06 Mar 2024
World Neurosurgery | VOL. 185

Infodemia and COVID-19: a text mining analysis
W De Caro
European Journal of Public Health | VOL. 30
W De CaroW De Caro
01 Sep 2020
European Journal of Public Health | VOL. 30

A Text Mining Analysis on Big Data Extracted from Social Media
Gabriella Schoier ... Giuseppe Borruso
-
Gabriella Schoier, et. al.Gabriella Schoier ... Giuseppe Borruso
01 Jan 2020
01 Jan 2020

Social media analysis for product safety using text mining and sentiment analysis
Haruna Isah ... Paul Trundle
-
Haruna Isah, et. al.Haruna Isah ... Paul Trundle
01 Sep 2014
01 Sep 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Net activism and whistleblowing on YouTube: a text mining analysis.

Abstract

Talk to us

Similar Papers

More From: Multimedia tools and applications