Abstract

Microsoft SQL Server is a complete suite of tools that include a relational database management system (RDBMS), multidimensional online analytical processing (OLAP) and tabular database engines, a brokering service, a scheduling service (SQL Agent), and many other features. As discussed in Chapter 1, it has become extremely important these days to integrate data between different sources. The advantage that SQL Server brings is that it offers a powerful Business Intelligence (BI) stack, which provides rich features for data mining and interactive reporting. One of these BI components is an Extract, Transform, and Load (ETL) tool called SQL Server Integration Services (SSIS). ETL is a process to extract data, mostly from different types of systems, transform it into a structure that’s more appropriate for reporting and analysis and finally load it into the database. SSIS, as an ETL tool offers the ability to merge structured and unstructured data by importing Hive data into SQL Server and apply powerful analytics on the integrated data. Throughout the rest of this chapter, you will get a basic lesson on how SSIS works and create a simple SSIS package to import data from Hive to SQL Server.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call