Abstract

As a result of the huge amounts of data available in any subject of research, data mining technologies confront considerable obstacles. Due to the large volume of data, most existing data mining methods are inapplicable to many real-world problems. Data mining methods become ineffective when the problem size becomes too large. Scalability is an issue that must be addressed in data mining algorithms in order to construct high-performance, efficient, and scalable data mining algorithms. A new scalability technique is proposed and applied to several data mining problems in this research. The proposed scaling methodology is developed using a cascade approach. The approach begins with the collection of a large data set from several sources, which is then preprocessed. Once the dataset has been preprocessed, decompose it into smaller data sets of equal size subsets. Then apply a data mining approach to each subset, with the identical data mining method stated for each subset. The findings of the data mining approach on all subsets are pooled and aggregated for the final output. The performance of the suggested algorithm is assessed using a number of criteria, including accuracy, precision, recall, F-score, and execution time. Social advertising and bank marketing are the two datasets on which the proposed method is tested. The suggested approach's performance is compared with non-scale data mining methods, and it is found that the scaling method outperformed non-scale data mining methods on all measures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.