Abstract

Data extraction is often a dynamic process that can be easily modelled as a workflow for data processing. When massive collections of data have to be evaluated and/or sophisticated data mining algorithms have to be performed, it can take very long to execute data analysis workflows. Effective technologies are also needed to incorporate flexible data collection workflows through the use of cloud-based storage platforms, where data is stored even more regularly. The paper attempts to show how cloud infrastructure is implemented to introduce an optimised framework in which scalable data analyzation workflows can be planned and performed. We explain how the Data Mining Cloud Architecture is built and applied and a data analytics method that incorporates visual workflow vocabulary, parallel to the Virtualized environment. DMCF is developed with a view to simplifying the creation of applications for data mining associated with generic system monitoring schemes that are not created especially for this area, in view of the specifications of actual data mining applications. The effects are a high-level environment that minimises the programming effort with an optimised visual workflow language, allowing the implementation of typical patterns meant to generate and execute data mining application in parallel simple to professional developers. The wall mounted of the workflow, device design and mechanisms of the DMCF are shown. We also address many DMCF-developed data mining business processes and the scalability achieved by running business processes in a cloud environment.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call