Abstract

This paper examines the dynamic relationship between vocabulary size, text length and text coverage in the English language, i.e. the ratio between the number of words of a text or a collection of texts covered by a set of vocabulary, and the total number of words of the text or a collection of texts. The results reveal that, on average, for texts between 50 and 1,000,000 words in length, text coverage by the same set of vocabulary is not significantly affected by text length; in addition, the relationship between text coverage and vocabulary size can be captured by the re-parametrized mathematical models of Altmann, Tuldava and Köhler and Martináková-Rendeková.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call