Abstract
For the last two decades, corpus technologies, understood as a combination of means and methods of processing and analyzing data of electronic linguistic corpora, as a type of information and communication technology, have attracted great interest of researchers and teachers of foreign languages.We explain the concepts of corpus linguistics, corpus technology, linguistic corpus, concordance. The methods of studying case technologies, which are an annotation, abstraction, and analysis, are considered. The advantages of linguistic corpora are given. The history of the emergence and development of linguistic electronic cases from the pre-digital to digital period is described. Minimum requirements for the corpus of texts are presented. They include representativeness, known volume of the corpus, electronic form, annotation and balance. We consider the typology of linguistic corpora. According to the language of the texts in corpora, there are monolingual and multilingual corpora, which in turn are divided into mixed and parallel ones. According to language data, there are written, oral and mixed corpora. Corpora can be annotated and non-annotated. There are three types of annotation: linguistic, metatextual, and extralinguistic. According to the parameter of representation of the language material of a corpus, there are fragmented and non-fragmented ones. According to the type of access, they are classified as open and restricted. According to the genre representation, linguistic corpora are diverse. The size of a corpus should distinguish between representative, illustrative and monitoring types of corpora. The didactic properties of corpus technologies in the field of teaching a foreign language are studied. The division of the linguodidactic properties of case technologies into mandatory and optional is proposed.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.