Abstract
The term “big data” will always be remembered as the big buzzword of 2013. According to the Wikipedia, big data “is a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools”. In other perceptions, the “3 Vs” that characterize it (i.e., volume, velocity, and variety) or the “4 Vs” (adding veracity to the previous three) are responsible for the fact that it exceeds an organization’s own data as well as its storage or compute capacity for accurate and timely decision-making. In essence, big data refers to the situation that more and more aspects and artifacts of everyday life, be it personal or professional, are available in digital form, e.g., personal or company profiles, social network and blog postings, buying histories, health records, to name just a few, that increasingly more data gets dynamically produced especially on the Internet and on the Web, and that nowadays the tools and techniques are available for evaluating and analyzing all that data in various combinations. Numerous companies already foresee the enormous business effects that analytical scenarios based on big data can have, and the impacts that it will hence have on advertising, commerce, and business intelligence (BI). This paper reviews the issues, techniques, and applications of big data, with an emphasis on future BI architectures.
Highlights
Ever since the beginning of the digital age, data in digital form has received a growing importance, first primarily in the business domain and later in the private domain
What does it take to do so, and what needs to change if the company has previously set up a data warehouse for its data analytics purposes? In particular, we briefly look at strategy development and present a modification of the “classical” data warehouse architecture that is intended to accommodate big data requirements
In this paper we have tried to survey various dimensions that are relevant to the field of big data that has emerged in recent years
Summary
Ever since the beginning of the digital age, data in digital form has received a growing importance, first primarily in the business domain and later in the private domain. As we have written in my book on Web 2.0 [26], this transition was determined by three parallel streams of development: the applications stream that has brought along a number of services anybody can nowadays use on the Internet and the Web; the technology stream which has provided the underlying infrastructure groundwork for all of this with fast moving and comprehensive advances in networking and hardware technology and quite a bit of progress regarding software; and the user participation and contribution stream (which we might call the socialization stream) which has changed the way in which users, both private and professional ones, perceive the Web, interact with it, contribute to it, and in particular publish their own or their private information on it These three streams have brought along a number of techniques, technologies, and usage patterns that at present converge, and the result is what has received the term “Web 2.0”. Due to a lack of expertise of the author, the legal dimension will not be dealt with in this paper
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.