Abstract
Data mining is seen as a set of techniques and technologies allowing to extract, automatically or semi-automatically, a lot of useful information, models, and tendencies from a big set of data. Techniques like “clustering,” “classification,” “association,” and “regression”; statistics and Bayesian calculations; or intelligent artificial algorithms like neural networks will be used to extract patterns from data, and the main goal to achieve those patterns will be to explain and to predict their behavior. So, data are the source that becomes relevant information. Research data are gathered as numbers (quantitative data) as well as symbolic values (qualitative data). Useful knowledge is extracted (mined) from a huge amount of data. Such kind of knowledge will allow setting relationships among attributes or data sets, clustering similar data, classifying attribute relationships, and showing information that could be hidden or lost in a vast quantity of data when data mining is not used. Combination of quantitative and qualitative data is the essence of mixed methods: on one hand, a coherent integration of result data interpretation starting from separate analysis, and on the other hand, making data transformation from qualitative to quantitative and 1 vice versa. A study developed shows how data mining techniques can be a very interesting complement to mixed methods, because such techniques can work with qualitative and quantitative data together, obtaining numeric analysis from qualitative data based on Bayesian probability calculation or transforming quantitative into qualitative data using discretization techniques. As a study case, the Psychological Inventory of Sports Performance (IPED) has been mined and decision trees have been developed in order to check any relationships among the “Self-confidence” (AC), “Negative Coping Control” (CAN), “Attention Control” (CAT), “Visuoimaginative Control” (CVI), “Motivational Level” (NM), “Positive Coping Control” (CAP), and “Attitudinal Control” (CACT) factors against gender and age of athletes. These decision trees can also be used for future data predictions or assumptions.
Highlights
IntroductionData Mining Added to Mixed Methods and Cluster Algorithms
Data Mining Added to Mixed Methods and Cluster AlgorithmsData mining is a technique that tries to find behavior patterns in large data sets in order to explain them
We have developed seven decision trees (Figures 1–7), taking IPED factors as a goal to determine the influence of age and gender on IPED scores
Summary
Data Mining Added to Mixed Methods and Cluster Algorithms. Data mining is a technique that tries to find behavior patterns in large data sets in order to explain them. Clustering technique allocates data into subsets that share some characteristics. Elements in each cluster are or have some features similar to the other elements in the cluster, but they are or have some features different to elements. This is a very useful tool because it allows you to find or identify unknown groups that frequently are not identified by humans (Zaki and Meira, 2014; Witten et al, 2016)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.