Abstract

The adoption of Web 2.0 technologies, Internet of Things, etc. by individuals and organization has led to an explosion of data. As it stands, existing Relational Database Management Systems (RDBMSs) are incapable of handling this deluge of data. The term Big Data was coined to represent these vast, fast and complex datasets that regular RDBMSs could not handle. Special tools or frameworks were developed to deal with processing, managing and storing this big data. These tools are capable of functioning in distributed industry- standard environments thereby maintaining efficiency and effectiveness at a business level. Apache Hadoop is an example of such a framework. This report discusses big data, it origins, opportunities and challenges that it presents, big data analytics and the application of big data using existing big data tools or frameworks. It also discusses Apache Hadoop as a big data framework and provides a basic overview of this technology from technological and business perspectives.

Highlights

  • For the first time in the history of modern technology, do not computers change, but the information, which does, they process

  • They round up by saying that big data comes from a myriad of sources which include the following and more; “sensors, devices, video/audio, networks, log files, transactional applications, web, and social media much of it generated in real time and in a very large scale” IBM (2017)

  • Several uses cases documented by Moise & Pournaras (2017) indicate that use of frameworks like Hadoop have been integral in helping organizations engage in big data analytics

Read more

Summary

Introduction

For the first time in the history of modern technology, do not computers change, but the information, which does, they process. According to Akamai (2017), several notable events have occurred from a technological perspective They include increased connection to the Internet all over the world i.e. increasing Internet penetration across the globe, the smart phone supplanting the personal computer as the primary computing device for most users, the use of social media as a primary method of communication and the disruption of said communication channel by authorities. It goes on188 To say that big data one or several of three critical characteristics namely; high variety, high volume or high velocity They round up by saying that big data comes from a myriad of sources which include the following and more; “sensors, devices, video/audio, networks, log files, transactional applications, web, and social media much of it generated in real time and in a very large scale” IBM (2017). This report will explore the advent of big data, the challenges that it created, and the tools that were developed to deal with the new paradigm and the problems that still exist in the big data sphere

Related works
Cassandra
Cloudera
Google refine
Hadoop
Rapid miner
An importance of the development
Methodologies behind big data
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.