Abstract

Parallel processing seems to be the great hope to speed up and scale up data mining algorithms, in order to cope with the huge size of real-world databases and data warehouses. However, most projects on parallel data mining have focused on the paralleilization of a single kind of algorithm or knowledge discovery paradigm. This tutorial will present a considerably broader view of the area of parallel data mining. In particular, it will discuss the parallelization of algorithms of four different knowledge discovery paradigms, namely rule induction, instance-based learning (or nearest neighbours), genetic algorithms and neutral networks. In addition, this tutorial will address both the use of “general- purpose” parallel machines and the use of commercially-available parallel database servers. Different parallelization strategies will be discussed and compared, for each of the four above- mentioned knowledge discovery paradigms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.