Abstract

Finding semantic text similarity (STS) between two pieces of text is a well-known problem in Natural Language Processing. Its applications are nearly in every field such as plagiarism detection, finding related user queries in customer services or finding similar questions in search engines or forums like Stack Overflow, Quora and Stack exchange. If applied to any religious text, it can help to relate how similar pieces of knowledge are described in different places. This paper uses Word2Vec and Sent2Vec models to facilitate the process of knowledge extraction from a given corpus. The paper makes use of several English translations of the Holy Quran which is the most sacred book for Muslims. Sent2vec models have been trained from several translations of the book and the trained models are then subsequently utilized to study the semantic relationship between different words and sentences. The performance of the custom-built word embeddings is compared against the pre-trained embeddings provided by the Spacy library.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.