Abstract

The last two decades were marked by an exponential growth in the volume of data originating from various data sources, from mobile phones to social media contents, all through the multitude devices of the Internet of Things. This flow of data can’t be managed using a classical approach and has led to the emergence of a new buzz word: Big Data. Among the research challenges related to Big Data there is the issue of data storage. Traditional relational database systems proved to be unable to efficiently manage Big Data datasets. In this context, Cloud Computing plays a relevant role, as it offers interesting models to deal with Big Data storage, especially the model known as Database as a Service (DBaaS). We propose, in this article, a review of database solutions that are offered as DBaaS and discuss their adaptability to Big Data applications.

Highlights

  • The volume of data stored in the world has been doubling every two years, and will reach a dazzling 40 billion terabytes (TB) by the year 2020 [1]

  • Gantz et al define Big Data in [33] as ―a new generation of technologies and architectures, designed to economically extract value from very large volumes of a wide variety of data, by enabling high-velocity capture, discovery, and/or analysis‖. This definition highlights a fifth V related to Big Data, namely Value, as it is not enough to store a large amount of data, but it is important to analyze it in order to extract value from it

  • MongoLab is a fully-managed, highly-performant, highlyavailable MongoDB database offered as DataBase as a Service (DBaaS) that runs in major cloud infrastructures: Amazon WS, Google Cloud Platform, Rackspace, and Windows Azure, etc

Read more

Summary

INTRODUCTION

The volume of data stored in the world has been doubling every two years, and will reach a dazzling 40 billion terabytes (TB) by the year 2020 [1]. It is estimated that more than 80% of Internet users use Cloud Computing in one form or another, from email services to different business applications as a service, all through data storage, development platforms, etc [2]. Through the plethora of definitions, it emerges that cloud computing has several major characteristics, especially the following: Virtualization: physical resources are virtualized in order to optimize their utilization; Pooling: multiple users share access to the same pool of virtualized resources This results in optimizing costs of infrastructure, installation, hosting, and maintenance for providers, who benefit from the economy of scale, and can offer more competitive prices; Ubiquity: cloud services are always accessible, anytime, anywhere, and from various computing devices; Remote access: cloud services are accessible via a network.

BIG DATA
BIG DATA STORAGE
NoSQL database systems
NewSQL database systems
Limitations
File Storage Systems
Cloud SQL
Cloudant
MongoLab
DynamoDB
Synchronous Asynchronous
DISCUSSION
Provider’s reputation
Deployment
X: NOT AVAILABLE
Database model
Law and regulations
Payment mode
Data volume
Data consistency
Findings
Scalability
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call