Sentence Similarity Based on Contexts

Xiaofei Sun,Chun Fan,Tianwei Zhang,Fei Wu,Yuxian Meng,Jiwei Li,Xiang Ao

doi:10.1162/tacl_a_00477

Abstract

AbstractExisting methods to measure sentence similarity are faced with two challenges: (1) labeled datasets are usually limited in size, making them insufficient to train supervised neural models; and (2) there is a training-test gap for unsupervised language modeling (LM) based models to compute semantic scores between sentences, since sentence-level semantics are not explicitly modeled at training. This results in inferior performances in this task. In this work, we propose a new framework to address these two issues. The proposed framework is based on the core idea that the meaning of a sentence should be defined by its contexts, and that sentence similarity can be measured by comparing the probabilities of generating two sentences given the same context. The proposed framework is able to generate high-quality, large-scale dataset with semantic similarity scores between two sentences in an unsupervised manner, with which the train-test gap can be largely bridged. Extensive experiments show that the proposed framework achieves significant performance boosts over existing baselines under both the supervised and unsupervised settings across different datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: May 16, 2022
Citations: 17	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Sentence Similarity Based on Contexts

Abstract

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

Contextual Sentence Similarity from News Articles
Nikhil Chaturvedi ... Jigyasu Dubey
International Journal of Scientific Research in Computer Science, Engineering and Information Technology | VOL. 10
Nikhil Chaturvedi, et. al.Nikhil Chaturvedi ... Jigyasu Dubey
14 Mar 2024
International Journal of Scientific Research in Computer Science, Engineering and Information Technology | VOL. 10

Sentence Similarity Based on Contexts
...
-
, et. al. ...
11 May 2022
11 May 2022

Priority based Semantic Web Crawler
Devshri Roy ... Jaytrilok Choudhary
International Journal of Computer Applications | VOL. 81
Devshri Roy, et. al.Devshri Roy ... Jaytrilok Choudhary
22 Nov 2013
International Journal of Computer Applications | VOL. 81

Kernel-Based Semantic Hashing for Gait Retrieval
Yucan Zhou ... Yongzhen Huang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 28
Yucan Zhou, et. al.Yucan Zhou ... Yongzhen Huang
01 Oct 2018
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sentence Similarity Based on Contexts

Abstract

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics