Romanian Slavonic in Unicode: problems and solutions

Ion-Mihai Felea

doi:10.17684/i14a198en

Abstract

Editors of Slavonic and Slavonic–Romanian text can make use of a large variety of tools (fonts, physical and virtual keyboard layouts, word processors, operating systems) for transcribing and digitizing these texts in a uniform manner. The uniformity of the transcripts is based on Unicode standardization. Our study aims at explaining the place of Slavonic in Unicode and at briefly describing the most accessible tools. To this end, we shall describe the working tools from a historical and functional perspective and then provide examples in which those tools can be or have already been used to obtain a more accurate transcript. The user can choose from the existing methods and tools according to his/her purposes, needs and means. A better understanding of technical data can reduce the working time, improve transcription, accelerate learning times and generally make an editor’s work much easier.

Highlights

Editors of Slavonic and Slavonic–Romanian text can make use of a large variety of tools for transcribing and digitizing these texts in a uniform manner
When using exclusively Unicode-compliant fonts, we ensure that our text stores exactly the characters we intended to transcribe in a standardized manner, which is independent of the font, device, software or operating system that our potential readers might use
The dative clitic is generally already fused with the noun in the stage of the first Romanian texts, yet some graphic particularities suggest the existence of a previous stage, between the one described by Coteanu and the one belonging to the current norm, where the demonstrative pronoun was not yet completely grammaticalized and had not lost its accent

Summary

Standardization of Cyrillic and Slavonic characters

Standardization for the Cyrillic alphabet was gradually implemented in various stages, starting with the year 1990. The modern Cyrillic alphabets were coded in a compact, basic block, which was assigned numbers between 1024 and 1279. The Cyrillic character A was assigned the number 1040, Cyrillic Б – 1041, and so on. The specific characters belonging to the Cyrillic alphabet used for Mordvinic, Azeri, Chuvash and other such languages were coded in 2002. This first additional group which was assigned numbers between 1280 and 1327 was called Cyrillic Supplement. Extended A, B and C, Phonetic Extensions, Combining Half Marks, Glagolitic and Glagolitic Supplement, were created mainly for researchers and coded characters such as superscript letters, Iota, as well as Înea and Ge for the Romanian texts. His study helps us understand the evolution of the tools available today compared with those existing over a decade ago

Slavonic Keyboard Layouts

Windows

Unicode Fonts

What fonts can be used?

BukyVede

Characters that are not encoded in Unicode

Morphologic-semantic disambiguation

Ligatures

Spaces

Musical manuscripts

Conclusions

Findings

Referințe

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Romanian Slavonic in Unicode: problems and solutions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Diacronia

Lead the way for us

Journal: Diacronia	Publication Date: Dec 12, 2021
License type: CC BY 4.0

Similar Papers

Emotion and the Development of Psychopathology
Pamela M Cole
-
Pamela M ColePamela M Cole
10 Feb 2016
10 Feb 2016

Return on time investment: Writing resources
Leslie H Nicoll
Nurse Author & Editor | VOL. 31
Leslie H NicollLeslie H Nicoll
01 Mar 2021
Nurse Author & Editor | VOL. 31

GPK
Xiyuan He ... Zhuohao Zhang
-
Xiyuan He, et. al.Xiyuan He ... Zhuohao Zhang
02 May 2019
02 May 2019

Text entry of physical and virtual keyboards on tablets and the user perception
Patrick Armstrong ... Brett Wilkinson
-
Patrick Armstrong, et. al.Patrick Armstrong ... Brett Wilkinson
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Romanian Slavonic in Unicode: problems and solutions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Diacronia