Abstract

Laufer and Nation (1995) proposed that the Lexical Frequency Profile (LFP) can estimate the size of a second‐language writer's productive vocabulary. Meara (2005) questioned the sensitivity and the reliability of LFPs for estimating vocabulary sizes, based on the results obtained from probabilistic simulations of LFPs. However, the underlying mathematical model for the simulations, based on Zipf's law, allows such an analysis to be done directly, without recourse to simulations. The direct analysis has the further advantage of demonstrating how variability estimates obtained from within the 1k band (the 1,000 most frequent words of English) portion of written texts may explain the simulation results. The findings confirm that the ability of LFPs to distinguish between groups diminishes as vocabulary size increases. However, for fairly homogeneous groups, LFPs are able to provide a coarse but reasonable tool for vocabulary size estimation. We also explore modifications to Zipf's law that may result in a more accurate model of word frequencies in natural language.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.