Abstract

Technological advances have allowed to collect and store large volumes of data over the years. Besides, it is significant that today’s applications have high performance and can analyze these large datasets effectively. Today, it remains a challenge for data mining to make its algorithms and applications equally efficient in the need of increasing data size and dimensionality [1]. To achieve this goal, many applications rely on parallelism, because it is an area that allows the reduction of cost depending on the execution time of the algorithms because it takes advantage of the characteristics of current computer architectures to run several processes concurrently [2]. This paper proposes a parallel version of the FuzzyPred algorithm based on the amount of data that can be processed within each of the processing threads, synchronously and independently.

Highlights

  • FuzzyPred is a data mining method that allows the extraction of fuzzy predicates in normal conjunctive and disjunctive form [3] [4]

  • This paper proposes a parallel version of the FuzzyPred algorithm based on the amount of data that can be processed within each of the processing threads, synchronously and independently

  • Because parallel computing must be exploited to solve data mining problems, this paper presents a parallel version of FuzzyPred with the purpose of reducing runtime

Read more

Summary

Introduction

FuzzyPred is a data mining method that allows the extraction of fuzzy predicates in normal conjunctive and disjunctive form [3] [4]. This method is modeled as a problem of combinatorial optimization because the space of solutions to travel can become very large. Each generated solution (or predicate) is sequentially evaluated in each of the database records. Considering the above, and due to the fact that the dimensions and the number of variables of the current databases increase in size every day, it is possible to obtain high response times in this process by using FuzzyPred [5]. Experiments are performed to compare the sequential version with the parallel version of FuzzyPred, in different performance metrics ( acceleration and efficiency)

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call