Abstract

Retrieval is one of the stages in case-based reasoning system which find a solution to new problem or case by measuring the similarity between the new case and old cases in the case base. Some of the similarity measurement techniques are involving feature weights that show the importance of the feature in a case. Feature weights can be obtained from a domain expert or by using a feature weighting method either locally or globally. Gradient descent is the feature weighting method which computes global weights for each feature. This research implemented gradient descent to obtain feature weights in case-based reasoning for hepatitis diagnosis and the similarity measurement using weighted Euclidean distance. There are four variations number of case base and test data that used in this research, those are: the first variation using 50% of data as case base and 50% as test data second variation using 60% of data as case base and 40% as test data, third variation using 70% of data as case base and 30% as test data and fourth variation using 80% of data as case base and 20% as test data. For each variation, using 4 kinds of scenario to mark the test data those are in first scenario the test data mark at the end of data, in second scenario the test data mark at the begin of data, in third scenario the test data mark half at the begin and half at the end of data and in the fourth scenario the test data mark in the middle of data. The result of this research showed that the accuracy of the system reaches 100% at scenario 1 in variation 4. Overall of all four variations and four kinds of scenario, the average accuracy of the system was 77.55%, average recall of system was 69.74%, and the average of precision was 78.39%. In addition, the level of accuracy was also influenced by the number of case base and the scenario of case selection for the case base. This is because more cases in the case base, the chances of a system to finding similar cases will be more.

Highlights

  • Case-Based Reasoning (CBR) is a problem-solving method using old experiences with the specific way

  • The weight of case features is very important in the similarity calculation that involves feature weight

  • Where N is the number of the case in case base and is similarity value of a pair cases and, with involving trained weights (w) that compute using equation (2)

Read more

Summary

Introduction

Case-Based Reasoning (CBR) is a problem-solving method using old experiences with the specific way. Case base is old experiences of problems that have solutions. Every case in case base consists of problem and solution [1][2] [3]. The retrieval process or process of finding old cases that have similarity to a new case is one of the most important processes in CBR system [4][5]. Some of the similarity measurement techniques are involving feature weights. The feature weight can provide information about the importance level of the feature in a case. The weight of case features is very important in the similarity calculation that involves feature weight.

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call