Abstract

Information is considered to be the major part of an organization. With the enhancement of technology, the knowledge level is increasing with the passage of time. This increase of information is in volume, velocity, and variety. Extracting meaningful insights is the dire need of an individual from such information and knowledge. Visualization is a key tool and has become one of the most significant platforms for interpreting, extracting, and communicating information. The current study is an endeavour toward data modelling and user knowledge by using a rough set approach for extracting meaningful insights. The technique has used different rough set algorithms such as K-nearest neighbours (KNN), decision rules (DR), decomposition tree (DT), and local transfer function classifier (LTF-C) for an experimental setup. The approach has found its accuracy for the optimal use of data modelling and user knowledge. The experimental setup of the proposed method is validated by using the dataset available in the UCI web repository. Results of the proposed study show that the model is effective and efficient with an accuracy of 96% for KNN, 87% for decision rules, 91% for decision trees, 85.04% for cross validation architecture, and 94.3% for local transfer function classifier. The validity of the proposed classification algorithms is tested using different performance metrics such as F-score, precision, accuracy, recall, specificity, and misclassification rates. For all these performance metrics, the KNN classifier outperformed, and this high performance shows the applicability of the KNN classifier in the proposed problem.

Highlights

  • With the passage of time, the information and user knowledge become increasing. is is due to the advancements and rapid development in technology

  • Different algorithms of rough set were applied for the experimental setup of the proposed research. ese algorithms include K-nearest neighbours (KNN), decision rule, decomposition tree, and local transfer function classifier (LTF-C)

  • Different performance metrics such as specificity, accuracy, precision, F-score, recall, and misclassification rates are followed to check the validity of the proposed model based on different classification algorithms. ese algorithms include KNN, cross validation/k-fold mechanism, decision rules, decomposition trees, and local transfer function classifier. e accumulated results and discussion are discussed below in detail

Read more

Summary

Introduction

With the passage of time, the information and user knowledge become increasing. is is due to the advancements and rapid development in technology. With the passage of time, the information and user knowledge become increasing. Is is due to the advancements and rapid development in technology. Essential information has become the need of users in their daily life which requires the support of advanced tools like Hadoop, Tableau, Informatica PowerCenter, and so on. E data and knowledge exist in diverse shapes such as structured and unstructured. E structured data are mostly understandable and can be managed, while extracting meaningful insights from unstructured data has become a challenging task. Electronic data of approximately 1.2 ZB (1021) are generated per year by diverse sources [2]. Human beings are always interested to capture the knowledge in an easy and effective way. Is easiness is due to the translation of data and knowledge through graphs or maps for user understanding Human beings are always interested to capture the knowledge in an easy and effective way. is easiness is due to the translation of data and knowledge through graphs or maps for user understanding

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.