Abstract
With the increase in high volume, velocity, and variety of data, the traditional data analysis approaches are not adequate to handle diverse analysis challenges. Traditionally, a data warehouse is being used which is an integrated repository from various sources used for management and decision-making in business. Data is already in a transformed and structured format stored in a costly but reliable storage device. The data warehouse does not include all the data that may be not required at the time of construction of the data warehouse. With the advent of big data and to handle the data silos problem, the concept of Data Lake is introduced to handle data analysis. Data lakes have not replaced the data warehouse but rather complement it. In this chapter, firstly Data Lake is introduced and compared with predecessor technologies, then various tools and techniques are discussed to implement Data Lake.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.