Abstract
Name disambiguation is the problem solving process to find similar names in sentences. The ambiguity of names can be found in hadith of Sahih Bukhari, names "Abdullah bin Amru" in hadiths no 27 and “Abdullah bin Amru” in hadith no 58, These names are the same, but there is no proof they are the same person. This problem is the early indication of ambiguity of name in the hadith. Based in this problem, this research aims to find name disambiguation of hadith narrators with classification by considering the perawi chain. To solved this problem the authors used Word Sense Disambiguation (WSD), WSD is a process to assign the same meaning from the sentences, based on the context in which the word appears. To classify several names in the hadith, the authors used KNN algorithm, by combining the WSD and KNN method can reduce the ambiguity of names in hadith. The data used in this study came from the hadith of Sahih Bukhori through the pre-processing stage. After conducting the research showed a collection of hadith numbers with the same name prediction with an accuracy of 99% at k = 1. Thus, this method can be used for name disambiguation.
Highlights
Each word certainly has its meaning, but what happens if there are words that have more than one meaning
This paper used machine learning process with the supervised learning method for word sense disambiguation, based on the k-nearest neighbor algorithm with an accuracy of 76.1%, have proven the effectiveness of using a method that considers the classification of named entities with entries from the most appropriate knowledge base (Rezapour et al, 2011)
This study aims to build a dataset containing a set of named entities (Shen et al, 2018), determine the results of disambiguation testing using Word Sense Disambiguation and Entity Linking and measure system performance based on precision, recall and f1-score
Summary
Each word certainly has its meaning, but what happens if there are words that have more than one meaning. This condition called ambiguous (Agrawal et al, 2019). Based on the Big Indonesian Dictionary, ambiguous means that it has more than one meaning. This ambiguity can raise doubts in the written or spoken sentence. A disambiguation process is needed so that the ambiguous word becomes a clear word. Disambiguation is the process of removing an ambiguous word by making it clear words (Zhang & Hasan, 2017). Disambiguation process is to distinguish between the meaning of the same word to be different (Moro et al, 2014)
Published Version (
Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have