Abstract

The term Big Data (or BigData) is widely used in scientific, educational, and business literature; however, there does not exist a single definition that can be unreservedly called “canonical”. A careless use of Big Data term to promote commercial software further emphasizes the importance of this issue. In this paper, we have performed a review of definitions of Big Data and highlighted the principal features that are attributed to Big Data. We compared all these principal features with features of databases compiled using Edgar F. Codd’s publications, and showed that they are not unique and can also be attributed to the databases. Having studied C. Lynch original work, we proposed the definition of Big Data based on the so-called conservation institution. The key point of this definition is a shift from purely technical attitude towards public institutions. Since the current use of the Big Data term may lead to a loss of meaning. There is a need not only to spread out best practices but also to eliminate or minimize the use of dubious or misleading ones.

Highlights

  • Specific study of a given phenomenon requires determination of a common terms dictionary that ensures consistent communications and understanding of the object being investigated

  • The Big Data term is widely used in relation to scientific, educational and business tasks but there is no single specific definition that can be unreservedly called as “canonical” Big Data definition

  • In this paper we present a review of typical Big Data definitions

Read more

Summary

Introduction

Specific study of a given phenomenon requires determination of a common terms dictionary that ensures consistent communications and understanding of the object being investigated. The Big Data term is widely used in relation to scientific, educational and business tasks but there is no single specific definition that can be unreservedly called as “canonical” Big Data definition. The use of Big Data term to promote commercial software intelligence solutions further exaggerates the situation. Clifford Lynch is considered the person who firstly introduced the term Big Data [1]. His paper does not provide explicit definition of the Big Data. It discusses the challenges that appear due to a significant increase of the data volumes and considers new solutions that allow to obtain, transform, store, and analyze those huge datasets. Lynch formulated a foundation of what he called “preservation institutions”

Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call