Abstract

Editors of Slavonic and Slavonic–Romanian text can make use of a large variety of tools (fonts, physical and virtual keyboard layouts, word processors, operating systems) for transcribing and digitizing these texts in a uniform manner. The uniformity of the transcripts is based on Unicode standardization. Our study aims at explaining the place of Slavonic in Unicode and at briefly describing the most accessible tools. To this end, we shall describe the working tools from a historical and functional perspective and then provide examples in which those tools can be or have already been used to obtain a more accurate transcript. The user can choose from the existing methods and tools according to his/her purposes, needs and means. A better understanding of technical data can reduce the working time, improve transcription, accelerate learning times and generally make an editor’s work much easier.

Highlights

  • Editors of Slavonic and Slavonic–Romanian text can make use of a large variety of tools for transcribing and digitizing these texts in a uniform manner

  • When using exclusively Unicode-compliant fonts, we ensure that our text stores exactly the characters we intended to transcribe in a standardized manner, which is independent of the font, device, software or operating system that our potential readers might use

  • The dative clitic is generally already fused with the noun in the stage of the first Romanian texts, yet some graphic particularities suggest the existence of a previous stage, between the one described by Coteanu and the one belonging to the current norm, where the demonstrative pronoun was not yet completely grammaticalized and had not lost its accent

Read more

Summary

Standardization of Cyrillic and Slavonic characters

Standardization for the Cyrillic alphabet was gradually implemented in various stages, starting with the year 1990. The modern Cyrillic alphabets were coded in a compact, basic block, which was assigned numbers between 1024 and 1279. The Cyrillic character A was assigned the number 1040, Cyrillic Б – 1041, and so on. The specific characters belonging to the Cyrillic alphabet used for Mordvinic, Azeri, Chuvash and other such languages were coded in 2002. This first additional group which was assigned numbers between 1280 and 1327 was called Cyrillic Supplement. Extended A, B and C, Phonetic Extensions, Combining Half Marks, Glagolitic and Glagolitic Supplement, were created mainly for researchers and coded characters such as superscript letters, Iota, as well as Înea and Ge for the Romanian texts. His study helps us understand the evolution of the tools available today compared with those existing over a decade ago

Slavonic Keyboard Layouts
Windows
Unicode Fonts
What fonts can be used?
BukyVede
Characters that are not encoded in Unicode
Morphologic-semantic disambiguation
Ligatures
Spaces
Musical manuscripts
Conclusions
Findings
Referințe

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.