Evaluation of Lexical-Based Approaches to the Semantic Similarity of Malay Sentences

Shahrul Azman Noah,Nazlia Omar,Amru Yusrin Amruddin

doi:10.1080/09296174.2014.1001637

Shahrul Azman Noah, Nazlia Omar + Show 1 more

https://doi.org/10.1080/09296174.2014.1001637

Copy DOI

Abstract

We evaluate existing and modified approaches for measuring the semantic similarity of sentences in the Malay language. These approaches are mainly used for English sentences and no studies to date have evaluated and compared their effectiveness when applied to Malay sentences. We used a pre-processed Malay machine-readable dictionary to calculate word-to-word semantic similarity with two methods: probability of intersection and normalization. We then used the word-to-word semantic similarity measure to identify semantic sentence similarity. We evaluated five measures of semantic sentence similarity: vector-based semantic similarity, word order similarity, highest word-to-sentence similarity, and combinations of vector-based and word-to-sentence similarity and of word order and word-to-sentence similarity. We also evaluated the effects of including and excluding lexical components such as prepositions, conjunctions, verbs, and morphological variants.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluation of Lexical-Based Approaches to the Semantic Similarity of Malay Sentences

Abstract

Talk to us

Similar Papers

More From: Journal of Quantitative Linguistics

Lead the way for us

Journal: Journal of Quantitative Linguistics	Publication Date: Mar 19, 2015
Citations: 6

Similar Papers

SyMSS: A syntax-based measure for short-text semantic similarity
Jesús Oliva ... Ángel Iglesias
Data & Knowledge Engineering | VOL. 70
Jesús Oliva, et. al.Jesús Oliva ... Ángel Iglesias
22 Jan 2011
Data & Knowledge Engineering | VOL. 70

Semantic Similarity Measures for Malay Sentences
Shahrul Azman Noah ... Nazlia Omar
-
Shahrul Azman Noah, et. al.Shahrul Azman Noah ... Nazlia Omar
10 Dec 2007
10 Dec 2007

A BERT-GRU Model for Measuring the Similarity of Arabic Text
Rakia Saidi ... Didier Schwab
JUCS - Journal of Universal Computer Science | VOL. 30
Rakia Saidi, et. al.Rakia Saidi ... Didier Schwab
28 Jun 2024
JUCS - Journal of Universal Computer Science | VOL. 30

BIOSSES: a semantic sentence similarity estimation system for the biomedical domain.
Gizem Soğancıoğlu ... Hakime Öztürk
Bioinformatics | VOL. 33
Gizem Soğancıoğlu, et. al.Gizem Soğancıoğlu ... Hakime Öztürk
12 Jul 2017
Bioinformatics | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluation of Lexical-Based Approaches to the Semantic Similarity of Malay Sentences

Abstract

Talk to us

Similar Papers

More From: Journal of Quantitative Linguistics