Abstract

The paper presents work on two databases of vernacular Old East Slavic writing, viz. the databases of birchbark letters and epigraphy. The task is to link the information of archaeological/historical and linguistic character, namely, the possibility of simultaneous expansion and updating of the database and of a linguistic corpus that enables grammatical and lexical search. The online database Old East Slavic birch bark letters and the section of the historical subcorpus of the Russian National corpus containing birch bark letters call for an integration. This includes, in particular, creating an online workstation for the morphological markup of texts linked to the database entries, the possibility of exporting XML-databases with morphological markup to include them in the RNC, the possibility of automatic generation of forward and backward word indices to the database on the basis of the marked corpus, mutual verification of previously compiled manually indices to the book by A. A. Zalizniak The Old Novgorod Dialect (a new updated edition of this book is being prepared) and markup of the corpus. Creation of a new database on the Old East Slavic epigraphy is designed to overcome the fragmented state of the publications and research in the field, combining the material accumulated by the scholars into a single electronic resource. From the point of view of programming, the architecture of the epigraphy database is analogous to the one of the birch bark database, which makes it possible to create an annotated corpus of the Old East Slavic epigraphy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.