PhrasIS: Phrase Inference and Similarity benchmark

I Lopez-Gazpio,M Maritxalar,J Gaviria,A Zarranz,P García,E Agirre,B Sanz,H Sanjurjo-González

doi:10.1093/jigpal/jzae037

PhrasIS: Phrase Inference and Similarity benchmark

I Lopez-Gazpio, M Maritxalar + Show 6 more

https://doi.org/10.1093/jigpal/jzae037

Copy DOI

Journal: Logic Journal of the IGPL

Publication Date: Apr 5, 2024

#Phrase Pairs #Image Captions + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

Abstract We present PhrasIS, a benchmark dataset composed of natural occurring Phrase pairs with Inference and Similarity annotations for the evaluation of semantic representations. The described dataset fills the gap between word and sentence-level datasets, allowing to evaluate compositional models at a finer granularity than sentences. Contrary to other datasets, the phrase pairs are extracted from naturally occurring text in image captions and news headlines. All the text fragments have been annotated by experts following a rigorous process also described in the manuscript achieving high inter annotator agreement. In this work we analyse the dataset, showing the relation between inference labels and similarity scores. With 10K phrase pairs split in development and test, the dataset is an excellent benchmark for testing meaning representation systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Logic Journal of the IGPL

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.