Abstract

Data mining is seen as a set of techniques and technologies allowing to extract, automatically or semi-automatically, a lot of useful information, models, and tendencies from a big set of data. Techniques like “clustering,” “classification,” “association,” and “regression”; statistics and Bayesian calculations; or intelligent artificial algorithms like neural networks will be used to extract patterns from data, and the main goal to achieve those patterns will be to explain and to predict their behavior. So, data are the source that becomes relevant information. Research data are gathered as numbers (quantitative data) as well as symbolic values (qualitative data). Useful knowledge is extracted (mined) from a huge amount of data. Such kind of knowledge will allow setting relationships among attributes or data sets, clustering similar data, classifying attribute relationships, and showing information that could be hidden or lost in a vast quantity of data when data mining is not used. Combination of quantitative and qualitative data is the essence of mixed methods: on one hand, a coherent integration of result data interpretation starting from separate analysis, and on the other hand, making data transformation from qualitative to quantitative and 1 vice versa. A study developed shows how data mining techniques can be a very interesting complement to mixed methods, because such techniques can work with qualitative and quantitative data together, obtaining numeric analysis from qualitative data based on Bayesian probability calculation or transforming quantitative into qualitative data using discretization techniques. As a study case, the Psychological Inventory of Sports Performance (IPED) has been mined and decision trees have been developed in order to check any relationships among the “Self-confidence” (AC), “Negative Coping Control” (CAN), “Attention Control” (CAT), “Visuoimaginative Control” (CVI), “Motivational Level” (NM), “Positive Coping Control” (CAP), and “Attitudinal Control” (CACT) factors against gender and age of athletes. These decision trees can also be used for future data predictions or assumptions.

Highlights

  • IntroductionData Mining Added to Mixed Methods and Cluster Algorithms

  • Data Mining Added to Mixed Methods and Cluster AlgorithmsData mining is a technique that tries to find behavior patterns in large data sets in order to explain them

  • We have developed seven decision trees (Figures 1–7), taking IPED factors as a goal to determine the influence of age and gender on IPED scores

Read more

Summary

Introduction

Data Mining Added to Mixed Methods and Cluster Algorithms. Data mining is a technique that tries to find behavior patterns in large data sets in order to explain them. Clustering technique allocates data into subsets that share some characteristics. Elements in each cluster are or have some features similar to the other elements in the cluster, but they are or have some features different to elements. This is a very useful tool because it allows you to find or identify unknown groups that frequently are not identified by humans (Zaki and Meira, 2014; Witten et al, 2016)

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call