A Comprehensive Study of the Parameters in the Creation and Comparison of Feature Vectors in Distributional Semantic Models

András Dobó,János Csirik

doi:10.1080/09296174.2019.1570897

András Dobó, János Csirik

Open Access

https://doi.org/10.1080/09296174.2019.1570897

Copy DOI

Journal: Journal of Quantitative Linguistics	Publication Date: Mar 12, 2019
Citations: 4	License type: open-access

Affiliation: University of Szeged

Abstract

ABSTRACT Measuring the semantic similarity and relatedness of words can play a vital role in many natural language processing tasks. Distributional semantic models computing these measures can have many different parameters, such as different weighting schemes, vector similarity measures, feature transformation functions and dimensionality reduction techniques. Despite their importance there is no truly comprehensive study simultaneously evaluating the numerous parameters of such models, while also considering the interaction of these parameters with each other. We would like to address this gap with our systematic study. Taking the necessary distributional information extracted from the chosen dataset as already granted, we evaluate all important aspects of the creation and comparison of feature vectors in distributional semantic models. Testing altogether 10 parameters simultaneously, we try to find the best combination of parameter settings, with a large number of settings examined in case of some of the parameters. Beside evaluating the conventionally used settings for the parameters, we also propose numerous novel variants, as well as novel combinations of parameter settings, some of which significantly outperform the combinations of settings in general use, thus achieving state-of-the-art results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comprehensive Study of the Parameters in the Creation and Comparison of Feature Vectors in Distributional Semantic Models

Abstract

Talk to us

Similar Papers

More From: Journal of Quantitative Linguistics

Lead the way for us

Similar Papers

Comparison of the Best Parameter Settings in the Creation and Comparison of Feature Vectors in Distributional Semantic Models Across Multiple Languages
András Dobó ... János Csirik
-
András Dobó, et. al.András Dobó ... János Csirik
01 Jan 2019
01 Jan 2019

A comprehensive analysis of the parameters in the creation and comparison of feature vectors in distributional semantic models for multiple languages

Procesamiento Del Lenguaje Natural | VOL. 64

10 Jan 2020
Procesamiento Del Lenguaje Natural | VOL. 64

Mixture of Topic-Based Distributional Semantic and Affective Models
Fenia Christopoulou ... Eleftheria Briakou
-
Fenia Christopoulou, et. al.Fenia Christopoulou ... Eleftheria Briakou
01 Jan 2018
01 Jan 2018

Extract Similarities from Syntactic Contexts: a Distributional Semantic Model Based on Syntactic Distance
Alessandro Maisto
Italian Journal of Computational Linguistics | VOL. 8
Alessandro MaistoAlessandro Maisto
01 Dec 2022
Italian Journal of Computational Linguistics | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comprehensive Study of the Parameters in the Creation and Comparison of Feature Vectors in Distributional Semantic Models

Abstract

Talk to us

Similar Papers

More From: Journal of Quantitative Linguistics