Data Warehouse Design to Support Social Media Analysis in a Big Data Environment

Carlos Roberto Val�Ncio,Luis Marcello Moraes Silva,Márcio Zamboti Fortes,Geraldo Francisco Donegá Zafalon,William Tenório,Angelo Cesar Colombini

doi:10.3844/jcssp.2020.126.136

Carlos Roberto Val�Ncio, Luis Marcello Moraes Silva + Show 4 more

Open Access

https://doi.org/10.3844/jcssp.2020.126.136

Copy DOI

Abstract

The volume of generated and stored data from social media has increased in the last decade. Therefore, analyzing and understanding this kind of data can offer relevant information in different contexts and can assist researchers and companies in the decision-making process. However, the data are scattered in a large volume, come from different sources, with different formats and are rapidly created. Such facts make the knowledge extraction difficult, turning it in a complex and high costly process. The scientific contribution of this paper is the development of a social media data integration model based on a data warehouse to reduce the computational costs related to data analysis, as well as support the application of techniques to discover useful knowledge. Differently from the literature, we focus on both social media Facebook and Twitter. Also, we contribute with the proposition of a model for the acquisition, transformation and loading data, which can enable the extraction of useful knowledge in a context where the human capability of understanding is exceeded. The results showed that the proposed data warehouse improves the quality of data mining algorithms compared to related works, while being able to reduce the execution time.

Highlights

In the last few years, the amount of data produced in the internet with the advent of web 2.0 technology has increased, especially the data from social media environment (Ghani et al, 2018)
This paper describes its definitions for building the schema and the opinion analysis, which can be positive, negative or neutral (Balazs and Velásquez, 2016)
The big data is defined as large data set with no pattern that exceeds the human capacity for understanding

Summary

Introduction

In the last few years, the amount of data produced in the internet with the advent of web 2.0 technology has increased, especially the data from social media environment (Ghani et al, 2018). This had a significant matter in contemporary society due to the ease of sharing and helping communication among people. The big data is defined as large data set with no pattern that exceeds the human capacity for understanding Such property is referenced in computer science nowadays, especially due to its potential in decision-making process and discovering trends and associations (Sivarajah et al, 2017). A brief explanation of principals is: Volume - is given by its magnitude and its size that can be terabytes, petabytes or exabytes

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Computer Science	Publication Date: Feb 1, 2020
Citations: 4	License type: cc-by

R Discovery Prime

R Discovery Prime

Data Warehouse Design to Support Social Media Analysis in a Big Data Environment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Computer Science

Lead the way for us

Similar Papers

MOTIVASI DAN IMPLIKASI PENGGUNAAN MEDIA SOSIAL FACEBOOK DIKALANGAN PEMUDA DIDESA NYIUR TEBEL

-

01 Jan 2020
01 Jan 2020

Business intelligence for social media interaction in the travel industry in Indonesia
Michael Yulianto ... Abba Suganda Girsang
Journal of Intelligence Studies in Business | VOL. 8
Michael Yulianto, et. al.Michael Yulianto ... Abba Suganda Girsang
05 Sep 2018
Journal of Intelligence Studies in Business | VOL. 8

DESAIN ETL DENGAN CONTOH KASUS PERGURUAN TINGGI
Spits Warnars
Jurnal Informatika | VOL. 10
Spits WarnarsSpits Warnars
01 Oct 2011
Jurnal Informatika | VOL. 10

Study of Meta-Data Enrichment Methods to Achieve Near Real Time ETL
N Mohammed Muddasir ... K Raghuveer
-
N Mohammed Muddasir, et. al.N Mohammed Muddasir ... K Raghuveer
05 Nov 2018
05 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data Warehouse Design to Support Social Media Analysis in a Big Data Environment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Computer Science