Compositionality and Sentence Meaning: Comparing Semantic Parsing and Transformers on a Challenging Sentence Similarity Dataset

James Fodor,Shinsuke Suzuki,Simon De Deyne

doi:10.1162/coli_a_00536

Abstract

Abstract One of the major outstanding questions in computational semantics is how humans integrate the meaning of individual words into a sentence in a way that enables understanding of complex and novel combinations of words, a phenomenon known as compositionality. Many approaches to modelling the process of compositionality can be classified as either ‘vector-based’ models, in which the meaning of a sentence is represented as a vector of numbers, or ‘syntax-based’ models, in which the meaning of a sentence is represented as a structured tree of labelled components. A major barrier in assessing and comparing these contrasting approaches is the lack of large, relevant datasets for model comparison. This paper aims to address this gap by introducing a new dataset, STS3k, which consists of 2,800 pairs of sentences rated for semantic similarity by human participants. The sentence pairs have been selected to systematically vary different combinations of words, providing a rigorous test and enabling a clearer picture of the comparative strengths and weaknesses of vector-based and syntax-based methods. Our results show that when tested on the new STS3k dataset, state-of-the-art transformers poorly capture the pattern of human semantic similarity judgments, while even simple methods for combining syntax- and vector-based components into a novel hybrid model yield substantial improvements. We further show that this improvement is due to the ability of the hybrid model to replicate human sensitivity to specific changes in sentence structure. Our findings provide evidence for the value of integrating multiple methods to better reflect the way in which humans mentally represent compositional meaning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Compositionality and Sentence Meaning: Comparing Semantic Parsing and Transformers on a Challenging Sentence Similarity Dataset

Abstract

Talk to us

Similar Papers

More From: Computational Linguistics

Lead the way for us

Journal: Computational Linguistics	Publication Date: Sep 18, 2024
License type: CC BY-NC-ND 4.0

Similar Papers

Sentence similarity measuring by vector space model
U. L. D. N. Gunasinghe ... A. S. Perera
-
U. L. D. N. Gunasinghe, et. al.U. L. D. N. Gunasinghe ... A. S. Perera
01 Dec 2014
01 Dec 2014

Semantics and Pragmatics
Guy Longworth
-
Guy LongworthGuy Longworth
18 Feb 2017
18 Feb 2017

Predicting Semantic Similarity Between Clinical Sentence Pairs Using Transformer Models: Evaluation and Representational Analysis.
Mark Ormerod ... Jesús Martínez Del Rincón
JMIR Medical Informatics | VOL. 9
Mark Ormerod, et. al.Mark Ormerod ... Jesús Martínez Del Rincón
26 May 2021
JMIR Medical Informatics | VOL. 9

SISR: System for integrating semantic relatedness and similarity measures
Mohamed Ben Aouicha ... Mohamed Ali Hadj Taieb
Soft Computing | VOL. 22
Mohamed Ben Aouicha, et. al.Mohamed Ben Aouicha ... Mohamed Ali Hadj Taieb
21 Nov 2016
Soft Computing | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Compositionality and Sentence Meaning: Comparing Semantic Parsing and Transformers on a Challenging Sentence Similarity Dataset

Abstract

Talk to us

Similar Papers

More From: Computational Linguistics