Abstract

Recently, the popularity of big data as a research field has shown continuous and wide-scale growth. This study aims to capture the scientific structure and topic evolution of big data research using bibliometrics and text mining-based analysis methods. Bibliographic data of journal articles regarding big data published between 2009 to 2018 were collected from the Scopus database and analyzed. The results show a significant growth of publications since 2014. Furthermore, the findings of this study highlight the core journals, most cited articles, top productive authors, countries, and institutions. Secondly, a unique approach to identifying and analyzing major research themes in big data publications was proposed. Keywords were clustered, and each cluster was labeled as a theme. Moreover, the papers were divided into four sub-periods to observe the thematic evolution. The theme mapping reveals that research on big data is dominated by big data analytics, which covers methods, tools, supporting infrastructure, and applications. Other critical aspects of big data research are security and privacy. Social networks and the Internet of things are significant sources of big data, and the resources and services offered by cloud computing strongly support the management and processing of big data.

Highlights

  • Roger Mougalas from O’Reilly Media coined the term “big data” for the first time in 2005 [1].It refers to a massive set of data that is hard to process and manage using traditional methods and tools

  • Social networks and the Internet of things are significant sources of big data, and the resources and services offered by cloud computing strongly support the management and processing of big data

  • This study presents an overview of big data research landscapes based on the analysis of one decade (2009–2018) of journal publications from the Scopus database

Read more

Summary

Introduction

Roger Mougalas from O’Reilly Media coined the term “big data” for the first time in 2005 [1]. The IDC as a pioneer in studying big data, defines big data as follows: “Big data technologies describe a new generation of technologies and architectures, designed to economically extract value from huge volumes of a wide variety of data, by enabling high-velocity capture, discovery, and/or analysis” [8]. A study to identify the relations between companies and big data technologies using text mining and topic modeling method was conducted by analyzing not the research publications but the news aggregated through Google. The resulting information and knowledge may hold interest for students, academics, We present the key-terms clusters analysis and capture the major themes in big data research; practitioners, science policy-makers, and R & D management in this field.

Data and Methods
Data Collection
Data Pre-Processing
General Analysis and Thematic Mapping
Bibliometrics Analysis of Publication Trends
Thematic Analysis of the Author and Index Keywords
Major Themes in Big Data Research
Big Data Analytics Tools and Algorithms
Privacy and Security Issue in Big Data
Big Data Applications and Services
Big Data-Related Technologies
Objectives
Future Big Data Research Trends
Threats to Validity
Findings
Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.