Abstract

Big data repositories are more and more massive and distributed, to manage them and make their contents useful, we need smart data analysis techniques and scalable architectures for extracting valuable information in reduced time. Cloud computing infrastructures offer an effective support for addressing both the computational and data storage needs of big data mining and parallel knowledge discovery applications. In fact, complex data mining tasks involve data-and compute-intensive algorithms that require large and efficient storage facilities together with high performance processors to get results in acceptable times. We are addressing main topics and research issues on efficiently using Cloud computing platforms for implementing big data mining applications on large data sets. We present data mining techniques and frameworks designed for developing distributed data analytics applications on Clouds. These systems implement data set storage, analysis tools, data mining algorithms and knowledge models as single services that are combined through a visual programming interface in distributed workflows.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call