Abstract

Introduction: Hepatitis C is a chronic infection caused by hepatitis c virus - a blood borne virus. Therefore, the infection occurs through exposure to small quantities of blood. It has been estimated by World Health Organization (WHO) to have affected 71 million people worldwide. This infection costs individual, groups and government a lot because no vaccine has been gotten yet for the treatment. This disease is likely to continue to affect more people because it’s long asymptotic phase which makes its early detection not feasible.Material and Methods: In this study, we have presented machine learning models to automatically classify the diagnosis test of hepatitis and also ranked the test features in order to know how they contribute to the classification which help in decision making process by the health care industry. The synthetic minority oversampling technique (SMOTE) was used to solve the problem of imbalance dataset.Results: The models were evaluated based on metrics such as Matthews correlation coefficient, F-measure, Precision-Recall curve and Receiver Operating Characteristic Area Under Curve. We found that using SMOTE techniques helped raise performance of the predictive models. Also, random forest (RF) had the best performance based on Matthews correlation coefficient (0.99), F-measure (0.99), Precision-Recall curve (1.00) and Receiver Operating Characteristic Area Under Curve (0.99).Conclusion: This discovery has the potential to impact on clinical practice, when health workers aim at classifying diagnosis result of disease at its early stage.

Highlights

  • Hepatitis C is a chronic infection caused by hepatitis c virus a blood borne virus

  • We found that using Synthetic Minority Oversampling Technique (SMOTE) techniques helped raise performance of the predictive models

  • This could last for decades after which the secondary symptoms begin to show up which leads to liver damage [4]. Another challenge is the diagnostic classification of Machine Learning Models for Diagnostic Classification of Hepatitis C Tests the hepatitis tests which is done after a person have been diagnosed with Hepatitis C Virus (HCV) infection, this would aid in determining the degree of liver damage [4], which we intend to propose automated methods to analyze laboratory tests results that will help to make timely decisions

Read more

Summary

Introduction

The infection occurs through exposure to small quantities of blood It has been estimated by World Health Organization (WHO) to have affected 71 million people worldwide. This infection costs individual, groups and government a lot because no vaccine has been gotten yet for the treatment. WHO affirms that a noteworthy number of individuals infected will develop cirrhosis or liver cancer This hepatitis C virus is a blood borne virus; the most common ways of infection is through exposure to small quantities of blood. Diagnosis of HCV is of great importance; but the major challenge of new HCV infections is asymptomatic in nature [4] This could last for decades after which the secondary symptoms begin to show up which leads to liver damage [4]. Another challenge is the diagnostic classification of Machine Learning Models for Diagnostic Classification of Hepatitis C Tests the hepatitis tests which is done after a person have been diagnosed with HCV infection, this would aid in determining the degree of liver damage [4], which we intend to propose automated methods to analyze laboratory tests results that will help to make timely decisions

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.