Abstract

Sentence similarity methods are used to assess the degree of likelihood between phrases. Many natural language applications such as text summarization, information retrieval, text categorization, and machine translation employ measures of sentence similarity. The existing approaches for this problem represent sentences as vectors of bag of words or the syntactic information of the words in the phrase. The likelihood between phrases is calculated by composing the similarity between the words in the sentences. Such schemes do not address two important concerns in the area, however: the semantic problem and the word order. This paper proposes a new sentence similarity measure that attempts to address such problems by taking into account the lexical, syntactic, and semantic analysis of sentences. The new similarity measure proposed outperforms the state of the art systems in around 6%, when tested using a standard and publically available dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.