Abstract
Chapter Al-Baqarah is the longest chapter in the Holy Quran, and it covers various topics. Al-Quran is the primary text of Islamic faith and practice. Millions of Muslims worldwide use Al - Quran as their reference book, and it, therefore, helps Muslims and Islamic scholars as guidance of the law life. Text clustering (unsupervised learning) is a process of separation that has to be divided text into the same section of similar documents. There are many text clustering algorithms and techniques used to make clusters, such as partitioning and density-based methods. In this paper, k-means preferred as a partitioning method and DBSCAN, OPTICS as a density-based method. This study aims to investigate and find which algorithm produced as the best accurate performance cluster for Al-Baqarah’s English Tafseer chapter. Data preprocessing and feature extraction using Term Frequency-Inverse Document Frequency (TF-IDF) have applied for the dataset. The result shows k-means outperformed even has the smallest of Silhouette Coefficient (SC) score compared to others due to less implementation time with no noise production for seven clusters of Al-Baqarah chapter. OPTICS has no noise with the medium of SC score but has the longest implementation time due to its complexity.
Highlights
The Quran is a significant religious text written in Quranic Arabic, followed by believers of the Islamic faith
Chapter Al-Baqarah has 53 subjects, some still have the same subject as others
The verses with the same subject are grouped into seven major themes or topics
Summary
The Quran is a significant religious text written in Quranic Arabic, followed by believers of the Islamic faith. The Quran means "perfect reading," in terms of language, which Muslims believed to be revealed to people as a guide in all aspects of life. Al-Quran wrote in Arabic but has translated into numerous languages around the world, as well as English. Chapter Al-Baqarah is Al-Quran's longest chapter, so there are various themes in Al-Baqarah's chapter. Such themes are not written sequentially but depend on asbabunnuzul ayat (verses) while revealed. Grouping verses of similar characteristics of a text they compose will form a cluster that could reflect any theme in Surah (chapter) Al–Baqarah [1]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal of Advanced Computer Science and Applications
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.