Abstract
In this study, the aim was to assess the effect and significance of hyperparameters in four different datasets containing different values for observation numbers and variable counts with the machine-learning methods of support vector machines and artificial neural networks. With this aim, a dataset comprising 15 repeats of 77 protein levels from 38 healthy and 34 down syndrome mice was used. A total of 138 different models and model classification performance criteria were obtained from the datasets in the study comprising combinations of hyperparameters in machine-learning methods. Comparison of the models used criteria like accurate classification percentage, kappa statistic, mean absolute error and square root of mean error squares. According to performance criteria, the first dataset with 1080 observations x 77 variables had 71.30% accurate classification percentage for assumed parameters with the support vector machines polynomial kernel function, while changing the hyperparameter variables increased this rate to 99.44%. Similarly, the second dataset had 50.65% accurate classification percentage with the artificial neural network single hidden layer 2 neuron model, while changing the hyperparameter values increased this rate to 90.46%. In conclusion, in situations with low variable and observation numbers, the machine learning methods were determined to display lower performance. However, in datasets, it is very important for classification performance in artificial neural networks and support vector machines, especially polynomial and radial basis function kernel functions, to set hyperparameters according to the dataset. In situations with low variable numbers, especially, the effect of hyperparameters was determined to gain importance.
Highlights
With machine learning methods algorithms developed for complicated and large data can perform classifications with a high degree of accuracy [1]
It has been stated in many studies [10,11,12,13] on machine learning that artificial neural networks and support vector machines are very successful methods for classification and estimation, as well as the presence of more than one hyperparameter in methods has been effective in the preference of methods in the Copyright © European Journal of Technique (EJT)
According to the results obtained in our study; - For all datasets, especially in polynomial and RBF kernel function support vector machines and artificial neural networks, the arrangement of hyperparameters according to the dataset is very important for classification performance, - In situations with low numbers of variables and observations, machine learning methods displayed lower performance, - In situations with a low number of variables, the effect of hyperparameters can be said to gain greater importance
Summary
With machine learning methods algorithms developed for complicated and large data can perform classifications with a high degree of accuracy [1]. Machine learning algorithms require adjustment before operation and include ‘hyperparameters’ with no clear defaults acceptable across a wide range of applications Examples of these hyperparameters that require determination for these algorithms are the depth of a decision tree, number of trees in the forest and number of neurons in each layer of an artificial neural network. These parameters have critical importance for machine learning because different hyperparameters generally result in performances with a significant degree of difference [3]. It has been stated in many studies [10,11,12,13] on machine learning that artificial neural networks and support vector machines are very successful methods for classification and estimation, as well as the presence of more than one hyperparameter in methods has been effective in the preference of methods in the Copyright © European Journal of Technique (EJT)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.