Abstract

Machine learning (ML) is used globally as a tool for predictive analysis. Within auditing, the use of audit data helps to uncover fraud indicators, identify risk areas and implement predictive models for continuous audit monitoring. Researchers are using various machine learning methods to analyze large and complex audit data to facilitate prediction. In this study, an online UCI dataset of 776 lines and 27 features is used. Out of these 27 features, 13 are eliminated due to their low impact on the target dataset or due to the ‘important feature selection’ algorithm. In this analysis, I used supervised learning methods, namely K-Nearest Neighbors, Logistic Regression, Random Forest, Support Vector Machine, Decision Tree, Linear Discriminant Analysis, Gaussian Naive Bayes, Extra Tree Algorithm, Gradient Boosting Algorithm, Ada Boosting Algorithm and XGBoost Algorithms. The experimental results highlight the power of eight neighbor KNN and evaluate its effectiveness, sensitivity, precision, accuracy and F1 score in comparison with other methods such as Naive Bayes, SVM (Linear Kernel), Decision Tree Classifier and Random Forest Classifier.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.