Abstract

To be competitive, companies need to be able to take advantage of the huge amounts of data, called also Big Data deluge, to predict what might happen in the future. In this way, predictive analytics play an important role for extracting useful information which may extend the business strategy and so gain competitive advantages. Predictive analytics involve data mining algorithms to discover knowledge from huge volumes of data. In this context, Association Rules (ARs) mining is considered as one of the most wide-spread data mining techniques. It is especially based on frequent itemsets mining process. However, when it comes to Big Data, ARs mining algorithms produce a huge amount of ARs, many of which are redundant and unuseful. To overcome this drawback, we propose a ontology-driven Map-Reduce Framework for ARs mining in massive data. Ontologies allow to filter the generated ARs and keep only useful ones. The filtering process is assured by a semantic pruning phase introduced in the Map-Reduce jobs in order to eliminate unuseful candidates from the computing of the Maximal Frequent Itemsets (MFI). This may allow a quantitative and especially qualitative reduction of the number of MFI and subsequently of the ARs. Extensive experiments on several datasets demonstrate the ability to handle massive data for mining ARs.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.