Expanding horizons in historical linguistics with the 400-million word Corpus of Historical American English

Mark Davies

doi:10.3366/cor.2012.0024

Expanding horizons in historical linguistics with the 400-million word Corpus of Historical American English

Mark Davies

https://doi.org/10.3366/cor.2012.0024

Copy DOI

Export

Save

Cite

Journal: Corpora	Publication Date: Nov 1, 2012
Citations: 182

#Corpus Of Historical American English #Corpus Of English #Text Archive #Non-fiction Books #Google Books #Popular Magazines #Historical Linguistics #American English #Changes In Syntax #Language Change

Abstract
Full-Text
Similar Papers

Abstract

Listen

The Corpus of Historical American English (COHA) contains 400 million words in more than 100,000 texts which date from the 1810s to the 2000s. The corpus contains texts from fiction, popular magazines, newspapers and non-fiction books, and is balanced by genre from decade to decade. It has been carefully lemmatised and tagged for part-of-speech, and uses the same architecture as the Corpus of Contemporary American English (COCA), BYU-BNC, the TIME Corpus and other corpora. COHA allows for a wide range of research on changes in lexis, morphology, syntax, semantics, and American culture and society (as viewed through language change), in ways that are probably not possible with any text archive (e.g., Google Books) or any other corpus of historical American English.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Corpora

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Expanding horizons in historical linguistics with the 400-million word Corpus of Historical American English