Abstract

In spite of important precedent, after an early phase computational and corpus research has been focusing on modern languages, especially English. Krauwer (1998) introduces idea of a Basic Language Resource Kit (BLARK), which he defines as the minimal set of language resources that is necessary to do any precompetitive research and education at all. The first step towards developing computational resources for Latin consists of collecting textual material in a format that can be handled by Natural Language Processing (NLP) tools. Those texts were not originally produced digitally, which means that a conversion from paper to electronic format is necessary. Morphological features of words can be included either through a manual annotation or through automatic methods. The chapter presents a brief overview on existing computational resources and tools for Latin covered annotated corpora, various NLP tools, and lexical databases.Keywords: annotating morphology; Basic Language Resource Kit (BLARK); computational resources; Krauwer; Latin corpora; Natural Language Processing (NLP) tools; semantic resources

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call