In this study, we investigated how effectively COVID-19 image categorization using Harr feature extraction and machine learning algorithms. We were particularly interested in the effectiveness of these algorithms. A dataset of 500 X-ray scans, equally split between 250 COVID-19-positive cases and 250 healthy controls, served as the basis for our study. K-nearest neighbors,decision tree, Linear regression, support vector machine, regression, classification, naive Bayes,random forest, as well as linear discriminant analysis were among the seven machine-learning approaches used to categorize the photos. With the use of Harr feature extraction, the features of the pictures were extracted. We studied the efficacy of COVID-19 X-ray images for classification utilizing the combination of machine learning as well as the Harr feature extraction methods in the present investigation due to their effectiveness. We searched a database of 500 X-rays for this investigation, dividing them equally between groups of 250 patients with COVID-19-positive cases and 250 healthy people. Following that, the images were examined using seven various machine learning approaches for recognition. These methods included naive Bayes, linear discriminant analysis, random forests, classification,k-nearest neighbors, and regression trees. The information from the photos was gathered using the Harr feature extraction method. The effectiveness of the algorithms was evaluated with the help of a variety of metrics, such asF1 score, precision,accuracy, recall, the area under the ROC curve, and the region of interest curve. According to our research, the Support Vector Machine algorithm had the highest accuracy, at 77%, while the Naive Bayes approach had the lowest accuracy, at 58%. By using machine learning and Harr feature extraction approaches, the Random Forest method yields the best results, based on our research. The development of future COVID-19 X-ray image-based automated diagnostic systems may be influenced by these findings. Results from the suggested model were comparable to those of cutting-edge models trained using transfer learning techniques. The proposed model's main advantage is that it has ten times fewer parameters than the most advanced models.A receiver operating characteristic (ROC) curve's F1 score, and the algorithms' accuracy, precision, the area under the curve, and recall were all used as metrics. According to our findings, the Naive Bayes method gained the least accuracy (58%) and the Support Vector Machine method produced the highest accuracy (77%) when used. Our results reveal that employing Harr feature extraction and machine learning techniques, the Random Forest strategy is the most successful way to recognize COVID-19 X-ray pictures. These findings may be pertinent to the development of automated COVID-19 diagnosis tools relying on X-ray images. The recommended model produced results that were competitive when measured against cutting-edge models trained using transfer learning techniques. The suggested model employs 10 times fewer parameters than the most advanced models, which is its key selling point.
Read full abstract