A generating model for Finnish nominal inflection using distributional semantics

Alexandre Nikolaev,R Harald Baayen,Yu-Ying Chuang

doi:10.1075/ml.22008.nik

Alexandre Nikolaev, R Harald Baayen + Show 1 more

Open Access

https://doi.org/10.1075/ml.22008.nik

Copy DOI

Abstract

AbstractFinnish nouns are characterized by rich inflectional variation, with obligatory marking of case and number, with optional possessive suffixes and with the possibility of further cliticization. We present a model for the conceptualization of Finnish inflected nouns, using pre-compiled fasttext embeddings (300-dimensional semantic vectors that approximate words’ meanings). Instead of deriving the semantic vector of an inflected word from another word in its paradigm, we propose that an inflected word is conceptualized by means of summation of latent vectors representing the meanings of its lexeme and its inflectional features. We tested this model on the 2,000 most frequent Finnish nouns and their inflected word forms from a corpus of Finnish (84 million tokens). Visualization of the semantic space of Finnish using t-SNE clarified that a ‘main effects’ additive model does not do justice to the semantics of inflection. In Finnish, how number is realized turns out to vary substantially with case. Further interactions emerged with the possessive suffixes and the clitics. By taking these interactions into account, the accuracy of our model, evaluated with the fasttext embeddings as gold standard, improved from 76% to 89%. Analyses of the errors made by the model clarified that 7.5% of errors are due to overabundance (and hence not true errors), and that 16.5% of the errors involved exchanges of semantically highly similar stems (lexemes). Our results indicate, first, that the semantics of Finnish noun inflection are more intricate than assumed thus far, and second, that these intricacies can be captured with surprisingly high accuracy by a simple generating model based on imputed semantic vectors for lexemes, inflectional features, and interactions of inflectional features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The Mental Lexicon	Publication Date: Dec 31, 2022
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A generating model for Finnish nominal inflection using distributional semantics

Abstract

Talk to us

Similar Papers

More From: The Mental Lexicon

Lead the way for us

Similar Papers

Isotonic Additive Interaction Models
Ilya Gluhovsky
Recent Advances and Trends in Nonparametric Statistics | VOL. -
Ilya GluhovskyIlya Gluhovsky
01 Jan 2003
Recent Advances and Trends in Nonparametric Statistics | VOL. -

Combining an Additive and Tree-Based Regression Model Simultaneously: STIMA
Elise Dusseldorp ... Bart Jan Van Os
Journal of Computational and Graphical Statistics | VOL. 19
Elise Dusseldorp, et. al.Elise Dusseldorp ... Bart Jan Van Os
01 Jan 2009
Journal of Computational and Graphical Statistics | VOL. 19

The Semantic Vectors Package: New Algorithms and Public Tools for Distributional Semantics
Dominic Widdows ... Trevor Cohen
-
Dominic Widdows, et. al.Dominic Widdows ... Trevor Cohen
01 Sep 2010
01 Sep 2010

Functional additive models for optimizing individualized treatment rules.
Hyung Park ... R Todd Ogden
Biometrics | VOL. 79
Hyung Park, et. al.Hyung Park ... R Todd Ogden
22 Nov 2021
Biometrics | VOL. 79

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A generating model for Finnish nominal inflection using distributional semantics

Abstract

Talk to us

Similar Papers

More From: The Mental Lexicon