Лингводидактические свойства корпусных технологий

Pitirim Y Zolotov

doi:10.20310/1810-0201-2020-25-185-75-82

Abstract

For the last two decades, corpus technologies, understood as a combination of means and methods of processing and analyzing data of electronic linguistic corpora, as a type of information and communication technology, have attracted great interest of researchers and teachers of foreign languages.We explain the concepts of corpus linguistics, corpus technology, linguistic corpus, concordance. The methods of studying case technologies, which are an annotation, abstraction, and analysis, are considered. The advantages of linguistic corpora are given. The history of the emergence and development of linguistic electronic cases from the pre-digital to digital period is described. Minimum requirements for the corpus of texts are presented. They include representativeness, known volume of the corpus, electronic form, annotation and balance. We consider the typology of linguistic corpora. According to the language of the texts in corpora, there are monolingual and multilingual corpora, which in turn are divided into mixed and parallel ones. According to language data, there are written, oral and mixed corpora. Corpora can be annotated and non-annotated. There are three types of annotation: linguistic, metatextual, and extralinguistic. According to the parameter of representation of the language material of a corpus, there are fragmented and non-fragmented ones. According to the type of access, they are classified as open and restricted. According to the genre representation, linguistic corpora are diverse. The size of a corpus should distinguish between representative, illustrative and monitoring types of corpora. The didactic properties of corpus technologies in the field of teaching a foreign language are studied. The division of the linguodidactic properties of case technologies into mandatory and optional is proposed.

Full Text