A Large-Scale Analysis of Variance in Written Language.

Brendan T Johns,Randall K Jamieson

doi:10.1111/cogs.12583

Abstract

The collection of very large text sources has revolutionized the study of natural language, leading to the development of several models of language learning and distributional semantics that extract sophisticated semantic representations of words based on the statistical redundancies contained within natural language (e.g., Griffiths, Steyvers, & Tenenbaum, ; Jones & Mewhort, ; Landauer & Dumais, ; Mikolov, Sutskever, Chen, Corrado, & Dean, ). The models treat knowledge as an interaction of processing mechanisms and the structure of language experience. But language experience is often treated agnostically. We report a distributional semantic analysis that shows written language in fiction books varies appreciably between books from the different genres, books from the same genre, and even books written by the same author. Given that current theories assume that word knowledge reflects an interaction between processing mechanisms and the language environment, the analysis shows the need for the field to engage in a more deliberate consideration and curation of the corpora used in computational studies of natural language processing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Large-Scale Analysis of Variance in Written Language.

Abstract

Talk to us

Similar Papers

More From: Cognitive Science

Lead the way for us

Journal: Cognitive Science	Publication Date: Jan 22, 2018
Citations: 23

Similar Papers

Distributional Semantic Models of Attribute Meaning in Adjectives and Nouns

-

01 Jan 2015
01 Jan 2015

Extract Similarities from Syntactic Contexts: a Distributional Semantic Model Based on Syntactic Distance
Alessandro Maisto
Italian Journal of Computational Linguistics | VOL. 8
Alessandro MaistoAlessandro Maisto
01 Dec 2022
Italian Journal of Computational Linguistics | VOL. 8

Semantic models for answer re-ranking in question answering
Piero Molino
-
Piero MolinoPiero Molino
28 Jul 2013
28 Jul 2013

Challenging distributional models with a conceptual network of philosophical terms
...
-
, et. al. ...
25 May 2021
25 May 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Large-Scale Analysis of Variance in Written Language.

Abstract

Talk to us

Similar Papers

More From: Cognitive Science