A study on sex prediction by using machine algorithms with anthropometric measurements of the seventh cervical vertebra.
Prediction of sex is among important topics of forensic medicine and forensic anthropology. In studies conducted for sex prediction, pelvis and cranium bones are the most preferred bones. In cases when it is difficult to examine the pelvis and cranium bones, vertebrae have been the subject of research in sex analysis studies. The aim of this study is to predict sex by using Computed Tomography (CT) images of the vertebra prominens (C7). Another aim of the study is to make automatic measurements using labeling on C7. This retrospective study included images of 100 female and 100 male individuals (aged 2050 years). CT Images on the personal workstation (Horos Project, Version 3.0) were made orthogonal in the entire plane. They were transferred to the Sekazu program in DICOM format. The labels of the bookmarks determined on C7 were placed on the images by the Radiologist and Anatomist according to their coordinates. Then, automatic measurements were performed in the program and calculations were made. Optimization of the study was achieved by automatic measurements, thus eliminating the effects of intra-observer and/or inter-observer measurement errors. Sixteen length and 3 angle parameters were analysed by using machine learning (ML) algorithms. The accuracy rates in sex prediction using ML algorithms with the parameters obtained as a result of the analysis are as follows: Ada Boost Classification 8791%, Decision Tree 8592%, Extra Trees Classifier 8793%, Gradient Boosting Model 8591%, Gaussian Naive Bayes 8791%, Gaussian Process Classifier 8191%, K-nearest Neighbour Regression 8493%, Linear Discriminant Analysis 8894%, Linear Support Vector Classification 8892%, Non-Linear Support Vector Classification 8393%, Quadratic Discriminant Analysis 8790%, Random Forest 8392%, Support Vector Machines 8492%. In this study, it was predicted that sex prediction could be made up to 94% using ML algorithms from the parameters of vertebra prominens, which is an atypical vertebra. Therefore, we can say that vertebra prominens also shows sexual dimorphism.
- Research Article
38
- 10.1371/journal.pone.0301541
- Apr 18, 2024
- PLOS ONE
Many individual studies in the literature observed the superiority of tree-based machine learning (ML) algorithms. However, the current body of literature lacks statistical validation of this superiority. This study addresses this gap by employing five ML algorithms on 200 open-access datasets from a wide range of research contexts to statistically confirm the superiority of tree-based ML algorithms over their counterparts. Specifically, it examines two tree-based ML (Decision tree and Random forest) and three non-tree-based ML (Support vector machine, Logistic regression and k-nearest neighbour) algorithms. Results from paired-sample t-tests show that both tree-based ML algorithms reveal better performance than each non-tree-based ML algorithm for the four ML performance measures (accuracy, precision, recall and F1 score) considered in this study, each at p<0.001 significance level. This performance superiority is consistent across both the model development and test phases. This study also used paired-sample t-tests for the subsets of the research datasets from disease prediction (66) and university-ranking (50) research contexts for further validation. The observed superiority of the tree-based ML algorithms remains valid for these subsets. Tree-based ML algorithms significantly outperformed non-tree-based algorithms for these two research contexts for all four performance measures. We discuss the research implications of these findings in detail in this article.
- Research Article
5
- 10.1080/23279095.2024.2382823
- Jul 31, 2024
- Applied Neuropsychology: Adult
The cognitive impairment known as dementia affects millions of individuals throughout the globe. The use of machine learning (ML) and deep learning (DL) algorithms has shown great promise as a means of early identification and treatment of dementia. Dementias such as Alzheimer’s Dementia, frontotemporal dementia, Lewy body dementia, and vascular dementia are all discussed in this article, along with a literature review on using ML algorithms in their diagnosis. Different ML algorithms, such as support vector machines, artificial neural networks, decision trees, and random forests, are compared and contrasted, along with their benefits and drawbacks. As discussed in this article, accurate ML models may be achieved by carefully considering feature selection and data preparation. We also discuss how ML algorithms can predict disease progression and patient responses to therapy. However, overreliance on ML and DL technologies should be avoided without further proof. It’s important to note that these technologies are meant to assist in diagnosis but should not be used as the sole criteria for a final diagnosis. The research implies that ML algorithms may help increase the precision with which dementia is diagnosed, especially in its early stages. The efficacy of ML and DL algorithms in clinical contexts must be verified, and ethical issues around the use of personal data must be addressed, but this requires more study.
- Research Article
- 10.55948/ijermca.2024.0710
- Jan 1, 2024
- International Journal of Enhanced Research in Management & Computer Applications
Heart disease is one of the most dangerous and largest killers in the world. Identifying heart disease early has a vast positive effect on patient outcomes and their quality of life. In this research, we try to identify heart disease using machine learning (ML) algorithms. ML algorithms have the highest probability of success if they work on a data set with extensive and diverse information about the given problem - in this case heart disease. There are multiple types of ML algorithms to test, so we can try many different ones on the data, making the results more precise. Even if there are many different algorithms to test, each machine-learning solution can yield different results depending on the dataset used and our target goals. The main goal of this study is to find the differences between individual ML algorithms being used in our specific case: which ML algorithm, or combination of algorithms, is appropriate to detect heart disease with high accuracy? The ML algorithms used in this research are the Naive Bayes Classifier, the Random Forest classifier, and the Support Vector Machine (SVM) algorithm. Furthermore, this study generates insights into these ML algorithms – if a particular algorithm’s model performs better than another on the dataset, analyzing this difference can help us understand what makes the model more suitable for diagnostic screening. Changing models’ hyperparameters or their pre-processing techniques allows for a more robust and reliable model that can be readily incorporated into a healthcare environment.
- Research Article
2
- 10.1093/eurheartj/ehab849.176
- Feb 4, 2022
- European Heart Journal
Funding Acknowledgements Type of funding sources: Public grant(s) – National budget only. Main funding source(s): TECHNOLOGY DEVELOPMENT FUND 1 Background Diabetes has become a major public health concern in Asia. In Malaysia, the prevalence of diabetes has escalated in adults above the age of 18, affecting 3.9 million individuals. Patients with diabetes and coronary heart disease have worse outcomes, compared with patients without diabetes who have coronary heart disease. Conventional Risk scores such as TIMI and GRACE were derived from a Western Caucasian cohort with limited data from Asian countries, despite Asia hosting 60% of the world’s population. Purpose It is important to recognize the significant features associated with in-hospital mortality risk that is population-specific in Asian diabetes patients with STEMI to achieve a reliable and effective clinical diagnosis and improved outcome. Electronic health records contain large amounts of information on patients’ medical history and are becoming invaluable research tools that could be applied to cardiovascular disease risk prediction through machine learning (ML) algorithms. With the current success of ML over conventional methods in STEMI mortality prediction, we aim to develop ML algorithms for in-hospital risk mortality in Asian patients diagnosed with DM that can be adopted for clinical predictions Methods We used registry data from the Malaysian National Cardiovascular Disease Database of 5783 patients diagnosed with DM from 2006 to 2016. Fifty parameters including demographics, cardiovascular risk, medications and clinical variables were considered. Four machine learning (ML) algorithms were constructed using a 70% registry dataset; Random Forest (RF), Support Vector Machine (SVM), Extreme Gradient Booster (XGB) and Logistic Regression (LR). Feature selections were done based on ML algorithms feature importance combined with Sequential Backward Selection (SBS). The area under the curve (AUC) was used as the performance evaluation metric. All algorithms were validated using a 30 % validation dataset and compared to the conventional TIMI risk score for STEMI. Results The best model SVM (AUC = 0.90) outperformed other ML algorithms (Figure 1) and TIMI risk score (AUC = 0.83). The best SVM model consists of 11 predictors which are Killip class, fasting blood glucose, age, systolic blood pressure, heart rate, ACE inhibitor, beta-blocker, total cholesterol, diastolic blood pressure, lower density lipoprotein, and diuretic (Figure 2). Common predictors of SVM and TIMI risk score are Killip class, age, systolic blood pressure, and heart rate. We have shown that the population-specific data mining approach for the prediction of diabetes patients’ mortality post-STEMI outperformed conventional TIMI risk score. Conclusion In the Asian multiethnic population, combination of ML approaches with features selection demonstrated promising outcomes in patients with DM that may be used for better patient prognostic than the conventional method. Abstract Figure 1: ML Best Model Performance Abstract Figure 2: Selected Predictors for ML
- Research Article
321
- 10.1038/s41598-020-72685-1
- Sep 29, 2020
- Scientific Reports
Several machine learning (ML) algorithms have been increasingly utilized for cardiovascular disease prediction. We aim to assess and summarize the overall predictive ability of ML algorithms in cardiovascular diseases. A comprehensive search strategy was designed and executed within the MEDLINE, Embase, and Scopus databases from database inception through March 15, 2019. The primary outcome was a composite of the predictive ability of ML algorithms of coronary artery disease, heart failure, stroke, and cardiac arrhythmias. Of 344 total studies identified, 103 cohorts, with a total of 3,377,318 individuals, met our inclusion criteria. For the prediction of coronary artery disease, boosting algorithms had a pooled area under the curve (AUC) of 0.88 (95% CI 0.84–0.91), and custom-built algorithms had a pooled AUC of 0.93 (95% CI 0.85–0.97). For the prediction of stroke, support vector machine (SVM) algorithms had a pooled AUC of 0.92 (95% CI 0.81–0.97), boosting algorithms had a pooled AUC of 0.91 (95% CI 0.81–0.96), and convolutional neural network (CNN) algorithms had a pooled AUC of 0.90 (95% CI 0.83–0.95). Although inadequate studies for each algorithm for meta-analytic methodology for both heart failure and cardiac arrhythmias because the confidence intervals overlap between different methods, showing no difference, SVM may outperform other algorithms in these areas. The predictive ability of ML algorithms in cardiovascular diseases is promising, particularly SVM and boosting algorithms. However, there is heterogeneity among ML algorithms in terms of multiple parameters. This information may assist clinicians in how to interpret data and implement optimal algorithms for their dataset.
- Research Article
180
- 10.1016/j.tust.2020.103383
- Mar 20, 2020
- Tunnelling and Underground Space Technology
Hybrid meta-heuristic and machine learning algorithms for tunneling-induced settlement prediction: A comparative study
- Research Article
10
- 10.3389/fnut.2022.740898
- Feb 17, 2022
- Frontiers in Nutrition
Machine learning (ML) algorithms may help better understand the complex interactions among factors that influence dietary choices and behaviors. The aim of this study was to explore whether ML algorithms are more accurate than traditional statistical models in predicting vegetable and fruit (VF) consumption. A large array of features (2,452 features from 525 variables) encompassing individual and environmental information related to dietary habits and food choices in a sample of 1,147 French-speaking adult men and women was used for the purpose of this study. Adequate VF consumption, which was defined as 5 servings/d or more, was measured by averaging data from three web-based 24 h recalls and used as the outcome to predict. Nine classification ML algorithms were compared to two traditional statistical predictive models, logistic regression and penalized regression (Lasso). The performance of the predictive ML algorithms was tested after the implementation of adjustments, including normalizing the data, as well as in a series of sensitivity analyses such as using VF consumption obtained from a web-based food frequency questionnaire (wFFQ) and applying a feature selection algorithm in an attempt to reduce overfitting. Logistic regression and Lasso predicted adequate VF consumption with an accuracy of 0.64 (95% confidence interval [CI]: 0.58–0.70) and 0.64 (95%CI: 0.60–0.68) respectively. Among the ML algorithms tested, the most accurate algorithms to predict adequate VF consumption were the support vector machine (SVM) with either a radial basis kernel or a sigmoid kernel, both with an accuracy of 0.65 (95%CI: 0.59–0.71). The least accurate ML algorithm was the SVM with a linear kernel with an accuracy of 0.55 (95%CI: 0.49–0.61). Using dietary intake data from the wFFQ and applying a feature selection algorithm had little to no impact on the performance of the algorithms. In summary, ML algorithms and traditional statistical models predicted adequate VF consumption with similar accuracies among adults. These results suggest that additional research is needed to explore further the true potential of ML in predicting dietary behaviours that are determined by complex interactions among several individual, social and environmental factors.
- Research Article
88
- 10.1016/j.atmosres.2015.09.021
- Oct 8, 2015
- Atmospheric Research
Comparison of four machine learning algorithms for their applicability in satellite-based optical rainfall retrievals
- Research Article
5
- 10.1515/comp-2020-0222
- Mar 10, 2022
- Open Computer Science
Millions of people across the world are suffering from diabetic retinopathy. This disease majorly affects the retina of the eye, and if not identified priorly causes permanent blindness. Hence, detecting diabetic retinopathy at an early stage is very important to safeguard people from blindness. Several machine learning (ML) algorithms are implemented on the dataset of diabetic retinopathy available in the UCI ML repository to detect the symptoms of diabetic retinopathy. But, most of those algorithms are implemented individually. Hence, this article proposes an effective integrated ML approach that uses the support vector machine (SVM), principal component analysis (PCA), and moth-flame optimization techniques. Initially, the ML algorithms decision tree (DT), SVM, random forest (RF), and Naïve Bayes (NB) are applied to the diabetic retinopathy dataset. Among these, the SVM algorithm is outperformed with an average of 76.96% performance. Later, all the aforementioned ML algorithms are implemented by integrating the PCA technique to reduce the dimensions of the dataset. After integrating PCA, it is noticed that the performance of the algorithms NB, RF, and SVM is reduced dramatically; on the contrary, the performance of DT is increased. To improve the performance of ML algorithms, the moth-flame optimization technique is integrated with SVM and PCA. This proposed approach is outperformed with an average of 85.61% performance among all the other considered ML algorithms, and the classification of class labels is achieved correctly.
- Research Article
7
- 10.1002/cpe.8021
- Jan 23, 2024
- Concurrency and Computation: Practice and Experience
SummarySoftware defined network (SDN) has emerged as a new paradigm in terms of network architecture, providing flexibility, agility, and programmability to network management. These benefits boosted the SDN adoption, bringing new challenges mainly related to security, in particular, those related to Denial of Service (DoS) and Distributed Denial of Service (DDoS) attacks. The detection, prevention, and mitigation of these attacks are important since they can affect the entire network. Many current security measures use statistical techniques, as entropy, or machine learning (ML) algorithms to detect DoS and DDoS attacks. While the definition of a threshold to determine whether a traffic is an attack is not trivial in statistical techniques, ML solutions may provide better accuracy but require considerable computational resources and time to converge to a model able to detect these attacks. Trying to circumvent these limitations, current hybrid approaches either use the results from entropy as input in ML algorithms (EntropyML) or use entropy as a filter and ML algorithms to identify attacks. This work goes one step ahead and combines these techniques in a three‐step approach (EntropyMLEntropy), called ML‐Entropy, which inherits the intelligence of ML algorithms to adjust the threshold used by entropy. The proposed solution was implemented and evaluated in two datasets, the well‐known synthetic DARPA dataset and a dataset composed by traffic collected from a real‐corporate environment. Experimental results show that, in general, ML‐Entropy presents an accuracy above 99%, similar to support vector machine (SVC) and random forest (RF) algorithms, being able to converge to a detection model up to and faster than RF and SVC, respectively.
- Research Article
11
- 10.1016/j.heliyon.2023.e18186
- Jul 1, 2023
- Heliyon
Machine learning algorithms for predicting the risk of fracture in patients with diabetes in China
- Research Article
139
- 10.1016/j.jhydrol.2021.125969
- Jan 12, 2021
- Journal of Hydrology
Merging multiple satellite-based precipitation products and gauge observations using a novel double machine learning approach
- Research Article
16
- 10.31557/apjcp.2022.23.10.3287
- Oct 1, 2022
- Asian Pacific Journal of Cancer Prevention
To identify which Machine Learning (ML) algorithms are the most successful in predicting and diagnosing breast cancer according to accuracy rates. The "College of Wisconsin Breast Cancer Dataset", which consists of 569 data and 30 features, was classified using Support Vector Machine (SVM), Naive Bayes (NB), Random Forest (RF), Decision Tree (DT), K-Nearest Neighbor (KNN), Logistic Regression (LR), Multilayer Perceptron (MLP), Linear Discriminant Analysis (LDA), XgBoost (XGB), Ada-Boost (ABC) and Gradient Boosting (GBC) ML algorithms. Before the classification process, the dataset was preprocessed. Sensitivity, accuracy, and definiteness metrics were used to measure the success of the methods. Compared to other ML algorithms used in the study, the GBC ML algorithm was found to be the most successful method in the classification of tumors with an accuracy of 99.12%. The XGB ML algorithm was found to be the lowest method with an accuracy rate of 88.10%. In addition, it was determined that the general accuracy rates of the 11 ML algorithms used in the study varied between 88-95%. When the results obtained from the ML classifiers used in the study are evaluated, the efficiency of the GBC algorithm in the classification of tumors is obvious. It can be said that the success rates obtained from 11 different ML algorithms used in the study are valuable in terms of being used to predict different cancer types.
- Research Article
8
- 10.1007/s42452-020-04035-9
- Jan 1, 2021
- SN Applied Sciences
Brain-computer interface (BCI) is believed to be the translator of brain signals into actions based on the model, built on the machine learning (ML) algorithms, incorporated in it. This study reports on the performance of various ML algorithms in evaluating efficacy of neurofeedback applied for treatment of central neuropathic pain (CNP). In the first phase of this study, we applied different ML algorithms for classification of electroencephalography (EEG) patterns, associated with CNP, obtained from three groups of participants, during imagined movement of their limbs, named as able-bodied (AB), paraplegic patients with (PWP) and without (PNP) neuropathic pain. In the second phase, we tested the accuracy of BCI-classifier by applying new EEG data obtained from PWP participants who have completed neurofeedback training provided for the management of pain. Support vector Machine (SVM) algorithm gained higher accuracy, with all groups, than the other classifiers. However, the highest classification accuracy of 99 ± 0.49% was obtained with the right hand motor imagery of (AB vs PWP) group and 61 electrodes. In Conclusion, SVM based BCI-classifier achieved high accuracy in evaluating efficacy of neurofeedback applied for treatment of CNP. Results of this study show that the accuracy of BCI changes with ML algorithm, electrodes combinations, and training data set.
- Research Article
1
- 10.17762/turcomat.v12i10.5025
- Apr 28, 2021
There is an enormous amount of data being dealt with by the medical field on a daily basis. Using a conventional method for handling data can affect the accuracy of the results. Early recognition of the disease is crucial for the analysis of patient medicines and specialists. The objective of this paper is to provide a comprehensive review of the techniques used in disease detection. Machine learning algorithms can be used to find out facts in medical research, particularly disease prediction. Machine learning algorithms such as Support vector machine [SVM], Decision trees, Bayes classifiers, K-Nearest Neighbours [KNN] Ensemble classifier techniques, etc. are used to determine different ailments. The use of machine learning algorithms can lead to fast and high accuracy prediction of diseases. This research paper analyses how machine learning techniques and algorithms are used to predict different diseases and their types. This paper provides an extensive survey of the machine learning techniques used for the prediction of chronic kidney disease, liver disease, haematological diseases, Alzheimer’s disease, and urinary tract infections.
- Ask R Discovery
- Chat PDF
AI summaries and top papers from 250M+ research sources.