Abstract

This chapter discusses the main models and techniques used for designing Cloud-based data analysis applications. The models presented here are based on MapReduce, workflows, and NoSQL database management systems. In the further sections, how each one of these three main approaches offer scalability for mining Big Data repository on Clouds, has been explained. Section 3.1 introduces the MapReduce model and how it can be used to implement scalable data analysis algorithms and applications. Section 3.2 discusses the workflow systems, presents some workflow management systems implemented on Cloud architectures, and discusses their main features to implement data analysis applications. Finally, Section 3.3 describes NoSQL database systems that were recently developed to efficiently manage large volumes of data. In several application cases, NoSQL databases are more scalable and provide higher performance than relational databases. Here we describe some representative NoSQL systems, and discuss use cases for NoSQL databases with a focus on data analytics.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call