Abstract

Lemmatization and Morphological lagging : their Application to Authorship Attribution. - Traditional methods of attributing an anonymous text to his own author have been increased by the outcome of linguistic statistics for a few years now. By far statistics provides a more objective way of comparing texts to one another. Textual corpora however have not often be tagged ; as researchers have not been given the opportunity to point out and systematically retrieve grammatical occurrences and features of a given corpus, there has been no other choice left than to study lexical connection between texts. The method has proved successful, results yet depend perceptibly on topics and literary genres. We will therefore proceed to analyse a classical Latin corpus in which texts have been lemmatizated and grammatically tagged. We will endeavour to examine if dissimilarity measures between texts from the study of grammatical parameters give finer and discriminating results than by lexical means. If our conclusion occurs to be positive, from now on, it is worth considering the undertaking of lemmatization of medieval Latin texts.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call