Testing the Relationship between Word Length, Frequency, and Predictability Based on the German Reference Corpus.

Alexander Koplenig,Marc Kupietz,Sascha Wolfer

doi:10.1111/cogs.13090

Alexander Koplenig, Marc Kupietz + Show 1 more

Open Access

https://doi.org/10.1111/cogs.13090

Copy DOI

Journal: Cognitive Science	Publication Date: Jun 1, 2022
Citations: 6	License type: other-oa

Affiliation: Leibniz Institute for the German Language

Abstract

In a recent article, Meylan and Griffiths (Meylan & Griffiths, 2021, henceforth, M&G) focus their attention on the significant methodological challenges that can arise when using large-scale linguistic corpora. To this end, M&G revisit a well-known result of Piantadosi, Tily, and Gibson (2011, henceforth, PT&G) who argue that average information content is a better predictor of word length than word frequency. We applaud M&G who conducted a very important study that should be read by any researcher interested in working with large-scale corpora. The fact that M&G mostly failed to find clear evidence in favor of PT&G's main finding motivated us to test PT&G's idea on a subset of the largest archive of German language texts designed for linguistic research, the German Reference Corpus consisting of ∼43 billion words. We only find very little support for the primary data point reported by PT&G.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Testing the Relationship between Word Length, Frequency, and Predictability Based on the German Reference Corpus.

Abstract

Talk to us

Similar Papers

More From: Cognitive Science

Lead the way for us

Similar Papers

The Challenges of Large-Scale, Web-Based Language Datasets: Word Length and Predictability Revisited.
Stephan C Meylan ... Thomas L Griffiths
Cognitive Science | VOL. 45
Stephan C Meylan, et. al.Stephan C Meylan ... Thomas L Griffiths
01 Jun 2021
Cognitive Science | VOL. 45

Author response: An oscillating computational model can track pseudo-rhythmic speech by using linguistic predictions
Sanne ten Oever ... Andrea E Martin
-
Sanne ten Oever, et. al.Sanne ten Oever ... Andrea E Martin
21 Jun 2021
21 Jun 2021

The Locus of Word Length and Frequency Effect in Comprehending English Words by Korean-English Bilinguals and Americans
Kichun Nam ... Yoonhyong Lee
-
Kichun Nam, et. al.Kichun Nam ... Yoonhyong Lee
01 Jan 2004
01 Jan 2004

Aphasia and spelling to dictation: Analysis of spelling errors and editing.
Charlotte Johansson‐Malmeling ... Åsa Wengelin
International journal of language & communication disorders | VOL. 56
Charlotte Johansson‐Malmeling, et. al.Charlotte Johansson‐Malmeling ... Åsa Wengelin
27 Dec 2020
International journal of language & communication disorders | VOL. 56

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Testing the Relationship between Word Length, Frequency, and Predictability Based on the German Reference Corpus.

Abstract

Talk to us

Similar Papers

More From: Cognitive Science