Classification of hadith into positive suggestion, negative suggestion, and information

Said Al Faraby,Adiwijaya Adiwijaya,Eliza Riviera Rachmawati Jasin,Andina Kusumaningrum

doi:10.1088/1742-6596/971/1/012046

Said Al Faraby, Adiwijaya Adiwijaya + Show 2 more

Open Access

https://doi.org/10.1088/1742-6596/971/1/012046

Copy DOI

Journal: Journal of Physics: Conference Series	Publication Date: Mar 1, 2018
Citations: 12	License type: cc-by

Affiliation: Telkom University

Abstract

As one of the Muslim life guidelines, based on the meaning of its sentence(s), a hadith can be viewed as a suggestion for doing something, or a suggestion for not doing something, or just information without any suggestion. In this paper, we tried to classify the Bahasa translation of hadith into the three categories using machine learning approach. We tried stemming and stopword removal in preprocessing, and TF-IDF of unigram, bigram, and trigram as the extracted features. As the classifier, we compared between SVM and Neural Network. Since the categories are new, so in order to compare the results of the previous pipelines, we created a baseline classifier using simple rule-based string matching technique. The rule-based algorithm conditions on the occurrence of words such as “janganlah, sholatlah, and so on” to determine the category. The baseline method achieved F1-Score of 0.69, while the best F1-Score from the machine learning approach was 0.88, and it was produced by SVM model with the linear kernel.

Full Text