Big Data Management: What to Keep from the Past to Face Future Challenges?

G Vargas-Solar,J L Zechinelli-Martini,J A Espinosa-Oviedo

doi:10.1007/s41019-017-0043-3

Abstract

The emergence of new hardware architectures, and the continuous production of data open new challenges for data management. It is no longer pertinent to reason with respect to a predefined set of resources (i.e., computing, storage and main memory). Instead, it is necessary to design data processing algorithms and processes considering unlimited resources via the “pay-as-you-go” model. According to this model, resources provision must consider the economic cost of the processes versus the use and parallel exploitation of available computing resources. In consequence, new methodologies, algorithms and tools for querying, deploying and programming data management functions have to be provided in scalable and elastic architectures that can cope with the characteristics of Big Data aware systems (intelligent systems, decision making, virtual environments, smart cities, drug personalization). These functions, must respect QoS properties (e.g., security, reliability, fault tolerance, dynamic evolution and adaptability) and behavior properties (e.g., transactional execution) according to application requirements. Mature and novel system architectures propose models and mechanisms for adding these properties to new efficient data management and processing functions delivered as services. This paper gives an overview of the different architectures in which efficient data management functions can be delivered for addressing Big Data processing challenges.

Highlights

Database management systems (DBMS) emerged as a flexible and cost-effective solution to information organization, maintenance and access problems found in organizations
Together with other approaches in the domain and in industry, we propose an approach and tool named ExSchema16 that enables the automatic discovery of schemata from polyglot persistence applications
Data management must be revisited for designing strategies that couple the characteristics of novel architectures with users’ preferences. In this context we identify three key scientific challenges: (i) data access and processing guided by Service Level Agreements (SLA) contracts, where data are produced by services and devices connected on heterogeneous networks; (ii) estimation and reduction in temporal, economic and energy consumption cost for accessing and processing data; (iii) optimization of data processing guided by SLA contracts expressed using cost models as reference

Summary

Introduction

Database management systems (DBMS) emerged as a flexible and cost-effective solution to information organization, maintenance and access problems found in organizations (e.g., business, academia and government). The evolution of data models and the consolidation of distributed systems made it possible to develop mediation infrastructures [109] that enable transparent access to multiple data sources through querying, navigation and management facilities. Examples of such systems are multi-databases, data warehouses, Web portals deployed on Internet/Intranets, polyglot persistence solutions [78]. The DBMS of the future must enable the execution of algorithms and of complex processes (scientific experiments) that use huge data collections (e.g., multimedia documents, complex graphs with thousands of nodes) This calls for a thorough revision of the hypotheses underlying the algorithms, protocols and architectures developed for classic data management approaches [31].

From Monolithic to Customizable DBMS Architectures

Classic Functional Architecture

OODBMS

Component-Oriented DBMS

Database Middleware

Configuring and Unbundling Data Management

Summarizing Componentization of DBMS

Service-Oriented DBMS

Intensive Big Data Management

NoSQL Data Store Managers

Dealing with Multiple Storage Spaces

Data Analytics

Parallel Model for Implementing Data Processing Functions

Big Data Analytics Systems

Big Data Analytics Stacks

Distributed Data Persistence Solutions

Cloud Data Management Services

ParAllel Runtime Environments

Discussion

Findings

Perspectives

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Data Science and Engineering	Publication Date: Aug 10, 2017
Citations: 21	License type: open-access

R Discovery Prime

R Discovery Prime

Big Data Management: What to Keep from the Past to Face Future Challenges?

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Science and Engineering

Lead the way for us

Similar Papers

Comparative analysis of data management system
Chengdu Yin ... Xin Lin
-
Chengdu Yin, et. al.Chengdu Yin ... Xin Lin
01 Jan 2015
01 Jan 2015

Data-sharing markets for integrating IoT data processing functionalities
Nasr Kasrin ... Daniela Nicklas
CCF Transactions on Pervasive Computing and Interaction | VOL. 3
Nasr Kasrin, et. al.Nasr Kasrin ... Daniela Nicklas
26 Feb 2021
CCF Transactions on Pervasive Computing and Interaction | VOL. 3

The Influence of Big Data Management on Organizational Performance in Organizations: The Role of Electronic Records Management System Potentiality
Burkan Hawash ... Muaadh Mukred
Interdisciplinary Journal of Information, Knowledge, and Management | VOL. 18
Burkan Hawash, et. al.Burkan Hawash ... Muaadh Mukred
01 Jan 2023
Interdisciplinary Journal of Information, Knowledge, and Management | VOL. 18

Big Energy Data Management for Smart Grids—Issues, Challenges and Recent Developments
Vidyasagar Potdar ... Saima Batool
-
Vidyasagar Potdar, et. al.Vidyasagar Potdar ... Saima Batool
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Big Data Management: What to Keep from the Past to Face Future Challenges?

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Science and Engineering