WEKA Tool Research Articles

Human identification of unknown samples following disaster and mass casualty events is essential, especially to bring closure to family and friends of the deceased. Unfortunately, victim identification is often challenging for forensic investigators as analysis becomes complicated when biological samples are degraded or of poor quality as a result of exposure to harsh environmental factors. Mitochondrial DNA becomes the ideal option for analysis, particularly for determining the origin of the samples. In such events, the estimation of genetic parameters plays an important role in modelling and predicting genetic relatedness and is useful in assigning unknown individuals to an ethnic group. Various techniques exist for the estimation of genetic relatedness, but the use of Machine learning (ML) algorithms are novel and presently the least used in forensic genetic studies. In this study, we investigated the ability of ML algorithms to predict genetic relatedness using hypervariable region I sequences; that were retrieved from the GenBank database for three race groups, namely African, Asian and Caucasian. Four ML classification algorithms; Support vector machines (SVM), Linear discriminant analysis (LDA), Quadratic discriminant analysis (QDA) and Random Forest (RF) were hybridised with one-hot encoding, Principal component analysis (PCA) and Bags of Words (BoW), and were compared for inferring genetic relatedness. The findings from this study on WEKA showed that genetic inferences based on PCA-SVM achieved an overall accuracy of 80–90% and consistently outperformed PCA-LDA, PCA-RF and PCA-QDA, while in Python BoW-PCA-RF achieved 94.4% accuracy which outperformed BoW-PCA-SVM, BoW-PCA-LDA and BoW-PCA-QDA respectively. ML results from the use of WEKA and Python software tools displayed higher accuracies as compared to the Analysis of molecular variance results. Given the results, SVM and RF algorithms are likely to also be useful in other sequence classification applications, making it a promising tool in genetics and forensic science. The study provides evidence that ML can be utilized as a supplementary tool for forensic genetics casework analysis.

Read full abstract

With the rapid development of computer technology, information technology covers all aspects of daily life, and the medical industry is also paying more attention to information construction. Conventional management methods have been unable to further improve the hospital’s management capabilities. At the same time, countries that are better in terms of hospital management practices have set a benchmark for mainland hospitals and reformed hospitals in order to stand out in the future. In addition to evaluating the economic benefits and work efficiency of doctors, hospitals must also consider that hospitals, as a special service industry, cannot be measured by economic indicators. Therefore, there is a multiparty game in the performance appraisal of hospitals, and it is necessary to consider not only economic factors but also the characteristics of public services. This article is based on the case of a large domestic tertiary hospital, combined with the hospital’s performance management reform plan, through the design idea of performance management and incentive performance pay distribution, using data mining technology as an auxiliary means. It successfully helped the hospital complete the performance and incentive performance pay aspects reform. The main research work of this paper is divided into the following three aspects. (1) Using data mining technology, according to each nursing unit’s workload, risk level, the difficulty of internship, and other objective factors in the past year for patient outpatient visits, surgery implementation, critical first aid, etc., are classified in line with the actual situation and provide a reliable basis for the reasonable and efficient allocation of hospital human resources. (2) In the performance management system, we integrate the third-party data mining tool weka to assist in the evaluation of the performance distribution plan and the calculation of the follow-up incentive performance pay. (3) We use the mathematical model of data mining to measure and evaluate the reasonableness of historical workload and performance appraisal, determine a new incentive performance pay distribution model, and use the software as a calculation tool for the internal distribution of performance wages to provide monthly incentive performance wage statistics in the future.

Read full abstract

WEKA Tool Research Articles

Related Topics

Articles published on WEKA Tool

Performance Analysis of Soil Health Classifiers Using Data Analytics Tools and Techniques for Best Model and Tool Selection

Penerapan Clustering K-Means untuk Pengelompokan Tingkat Kepuasan Pengguna Lulusan Perguruan Tinggi

Implementasi Algoritma Data Mining J48 Untuk Klasifikasi Mahasiswa Yang Layak Mendapat Beasiswa PPA

Usage of Machine Learning Algorithm Models to Predict Operational Efficiency Performance of Selected Banking Sectors of India

Fast and accurate classifying model for denial-of-service attacks by using machine learning

Detection and Investigation of DDoS Attacks in Network Traffic using Machine Learning Algorithms

Classification Model for Hepatitis B Disease Using Supervised Machine Learning Technique

Towards a classification of sustainable software development process using manifold machine learning techniques

Evaluating the Performance of Supervised Machine Learning Algorithms in Breast Cancer Datasets

Diagnosis of Breast Cancer Pathology on the Wisconsin Dataset with the Help of Data Mining Classification and Clustering Techniques.

Peningkatan Performa Klasifikasi Machine Learning Melalui Perbandingan Metode Machine Learning dan Peningkatan Dataset

The application of machine learning to predict genetic relatedness using human mtDNA hypervariable region I sequences.

IMPLEMENTASI DATA MINING MENGGUNAKAN METODE NAIVE BAYES DENGAN FEATURE SELECTION UNTUK PREDIKSI KELULUSAN MAHASISWA TEPAT WAKTU

Application of Data Mining in Performance Management of Public Hospitals

Comparative analysis of HAR datasets using classification algorithms

Comparative Study for Prediction of Low and High Plasma Protein Binding Drugs by Various Machine Learning-Based Classification Algorithms

Study and Application of Industrial Thermal Comfort Parameters by Using Bayesian Inference Techniques

Using Decision Trees to Predict Critical Reading Performance.

Security Analysis of DDoS Attacks Using Machine Learning Algorithms in Networks Traffic

IMPROVING STUDENTS PERFORMANCE PREDICTION USING MACHINE LEARNING AND SYNTHETIC MINORITY OVERSAMPLING TECHNIQUE

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

WEKA Tool Research Articles

Related Topics

Articles published on WEKA Tool

Performance Analysis of Soil Health Classifiers Using Data Analytics Tools and Techniques for Best Model and Tool Selection

Penerapan Clustering K-Means untuk Pengelompokan Tingkat Kepuasan Pengguna Lulusan Perguruan Tinggi

Implementasi Algoritma Data Mining J48 Untuk Klasifikasi Mahasiswa Yang Layak Mendapat Beasiswa PPA

Usage of Machine Learning Algorithm Models to Predict Operational Efficiency Performance of Selected Banking Sectors of India

Fast and accurate classifying model for denial-of-service attacks by using machine learning

Detection and Investigation of DDoS Attacks in Network Traffic using Machine Learning Algorithms

Classification Model for Hepatitis B Disease Using Supervised Machine Learning Technique

Towards a classification of sustainable software development process using manifold machine learning techniques

Evaluating the Performance of Supervised Machine Learning Algorithms in Breast Cancer Datasets

Diagnosis of Breast Cancer Pathology on the Wisconsin Dataset with the Help of Data Mining Classification and Clustering Techniques.

Peningkatan Performa Klasifikasi Machine Learning Melalui Perbandingan Metode Machine Learning dan Peningkatan Dataset

The application of machine learning to predict genetic relatedness using human mtDNA hypervariable region I sequences.

IMPLEMENTASI DATA MINING MENGGUNAKAN METODE NAIVE BAYES DENGAN FEATURE SELECTION UNTUK PREDIKSI KELULUSAN MAHASISWA TEPAT WAKTU

Application of Data Mining in Performance Management of Public Hospitals

Comparative analysis of HAR datasets using classification algorithms

Comparative Study for Prediction of Low and High Plasma Protein Binding Drugs by Various Machine Learning-Based Classification Algorithms

Study and Application of Industrial Thermal Comfort Parameters by Using Bayesian Inference Techniques

Using Decision Trees to Predict Critical Reading Performance.

Security Analysis of DDoS Attacks Using Machine Learning Algorithms in Networks Traffic

IMPROVING STUDENTS PERFORMANCE PREDICTION USING MACHINE LEARNING AND SYNTHETIC MINORITY OVERSAMPLING TECHNIQUE