Abstract

The aim of paraphrasing identification techniques is to identify if two sentences or texts have the same meaning; even if they do not contain a number of identical phrases or words. In this paper we proposed a hybrid technique based on using attention constituency vector (ACV)-tree kernel in short texts or sentence similarity computation. Then a similarity threshold is used to identifying if these sentences are paraphrased or not. The experiments are conducted on Arabic paraphrasing benchmark. The proposed method provides a recall of 70% and a precision of 76%, when the determined threshold is 0.5, while a recall of 94% and a precision of 0.751 achieved using a threshold of 0.3.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.