Abstract

This paper presents a survey of the diachroniccorporaavailable online for the study of the Romance languages. In the first place the makeup of eachcorpusis described, indicating the number of texts and tokens included and the manner of classification of the documents following chronological, typological and diatopic criteria. After having examined the problems involved in lemmatization and morphosyntactic annotation, the paper will look at query options with a view to possible research into lexicon, morphology, syntax and semantics. A short conclusion will consist in the presentation of the MIDIAcorpus, published in June 2014, which represents the first tool devised for the study of Italian from a lengthy diachronic perspective (from the earliest texts to the mid-twentieth century).

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call