Lexical Semantic Categorization Research Articles

The purpose of creating conservation areas is to protect endangered plant and animal species. Large, tagged linguistic corpora with a great variety of genres are used for the preservation and research of safe and endangered languages. The article describes the history, structure and development of the Open Corpus of the Veps and Karelian languages. The Veps language corpus was created in 2009 under the leadership of Nina Zaitseva. Three Karelian subcorpora (Karelian proper, Livvi and Ludian) were included in the linguistic corpus in 2016. The united linguistic platform was named “The Open Corpus of the Veps and Karelian languages” (VepKar). This linguistic corpus includes texts and dictionaries stored in a database, and a computer program (corpus manager) for searching and processing the data. This corpus manager was written in the PHP programming language in the Laravel framework. The data are stored in a MySQL database. Corpus and dictionaries data are available online (dictorpus.krc.karelia.ru). YouTube and Wikipedia are used by VepKar authors to popularize the corpus. Dictionaries and corpus texts are strongly interrelated. Multifunctional dictionaries of the Veps and Karelian languages contain definition, translation, dialect labels, semantic relations (synonyms, antonyms, etc.), examples of word usage with reference to texts, as well as complete inflectional paradigms. All texts are automatically marked up and there are references from words in the text to the corresponding meanings in the dictionary entries. The developers continue adding useful new features to the corpus manager to make the work of editors easier. For example, over the past three years, nominal and verbal inflection rules have been formulated and programmed for all dialects of the Veps language and its newly-written version, as well as for the Livvi-Karelian, North Karelian and Tver newly-written versions of the Karelian language. Thanks to this, 2.1 million word forms were generated in the VepKar system in a semi-automatic mode. The semantic markup in the corpus is 2.1 million links between words from the text and the meanings of lemmas in the dictionary. The grammatical markup was added, namely, 1.1 million links between words from the text and the grammatical features of word forms from the dictionary were automatically established. The multilingual VepKar corpus is divided into subcorpora according to languages and dialects, and the texts are also classified into styles and genres. The corpus has a sophisticated search system (with filtering of texts by language, style and dialect, by informant, collector or author, by year of recording or year of publication). It is possible to search for lemmas by dialects, parts of speech, grammatical features, and even by lexical-semantic categories. These categories appeared due to the integration of the data of the outstanding “Comparative and Onomasiological Dictionary of the Dialects of the Karelian, Veps and Sami Languages” into the vocabulary part of VepKar. In 2021, the Sanahelmi electronic dictionary was created on the basis of VepKar for Android phones. The development of mobile applications based on corpus data is our bright future.

Read full abstract

espanolEste articulo se basa en la teoria del caracter universal de la categoria de causalidad lexico-semantica y su caracter especifico. Teniendo en cuenta la ausencia de la categoria gramatical de la causacion en ambos idiomas, considero la posibilidad de que se compense con la categoria de formacion de palabras correspondiente comprendida por los verbos causativos derivados de la posicion. Los verbos causales posicionales se utilizan en este estudio como un termino general que se refiere a una clase de verbos causales que codifican semanticamente la postura corporal estatica o la posicion de los seres animados o la ubicacion estatica de los objetos inanimados en el espacio. La derivacion, la semantica y la pragmatica de los verbos causales derivados de la posicion en aleman y ucraniano se analizan para describir las relaciones locativas EnglishThis research is based on the universal theory of the lexical-seman-ticcategory of causation. Taking into ac-countthe absence of the grammatical category of causation in German and Ukrainian, I consider the possibility that it is compensated by the correspondingword-formation category comprised positional derived causative verbs. They semantically encode the change of the body position of animate beings as well as the location of inanimate objects in space. Although the positional verbs have been described in many languages, the semantic features of positional causative verbs have never been previously investigated in German and Ukrainian, which accounts for the novelty of the research. The derivation and semantics of positional derived causative verbs are analyzed to show how typologically different languages, on the one hand, have similarity in expressing the basic concepts of positional causative situation. But on the other hand, they differ in the spatial representation francaisCet article est base sur la theorie du caractere universel de la categoriede causalite lexico-semantique et de son caractere specifique. Compte tenu de l’absence de la categorie grammaticale de causalite dans les deux langues, j’envisage la possibilite qu’elle soit compensee par la categorie de formation de mots correspondante,constituee par les verbes causatifs derives positionnels.Les verbes causatifs de position sont utilises dans cette etude comme un terme generique qui designe une classe de verbes causatifs qui encodent semantique ment la positional corporelle statique ou la position des etres animes ou la position statique des objets inanimes dans l'espace. Bien que les verbes de position aient ete decrits dans de nombreuses langues, leurs caracteristiques semantiques n’ont jamais ete explorees auparavant en allemand et en ukrainien. La derivation et la semantique des verbes causatifs derives positionnels sont analysees pour montrer comment des langues typologiquement differentes, d'une part, ont une similitude dans l'expression des concepts de base de situation causative positionnelle. Mais d'un autre cote, ils different dans la representation spatiale

Read full abstract

Lexical Semantic Categorization Research Articles

Related Topics

Articles published on Lexical Semantic Categorization

Categorizing possession in Zuanga-Yuanga and other Kanak languages (New Caledonia): a typological perspective

TERMINOLOGICAL ANALYSIS IN ENGLISH AND UZBEK LINGUISTICS

Semantics and Ways of Expressing the Plural of Nouns in Russian and Japanese Languages

On determination of the part-of-speech affiliation of modal words

Buchnąć z futrówy, odpypić z ponsy – about vocabulary from the category „cheating” in student jargon from the turn of the 19th and 20th centuries

THE LINGUISTIC CORPUS VEPKAR IS A LANGUAGE REFUGE FOR THE BALTICFINNISH LANGUAGES OF KARELIA

Means of informal communication in political cartoons

Proper names from story recall are associated with beta-amyloid in cognitively unimpaired adults at risk for Alzheimer's disease

Positional verbs: derivation semantics and functioning

Miejsce słowotwórstwa w gramatyce pisanej według formuły „treść > forma”

Semantic (Ir)regularities in Action Nouns in Irish

Lexical Borrowing, Categorization, and Mental Representation

Representational Similarity Mapping of Distributional Semantics in Left Inferior Frontal, Middle Temporal, and Motor Cortex.

Noun Gender in Romanian, a Lexical-Semantic Category

Emotional arousal and lexical specificity modulate response times differently depending on ear of presentation in a dichotic listening task

Norma categorial para el español de Bogotá, Colombia.

Antonymy on historical aspect

Verbal fluency impairments among middle-aged and older outpatients with schizophrenia are characterized by deficient switching

De la "naturalité" des catégories sémantiques : des catégories d'objets naturels aux catégories lexicales

Deep dyslexia in a Dutch-speaking patient

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Lexical Semantic Categorization Research Articles

Related Topics

Articles published on Lexical Semantic Categorization

Categorizing possession in Zuanga-Yuanga and other Kanak languages (New Caledonia): a typological perspective

TERMINOLOGICAL ANALYSIS IN ENGLISH AND UZBEK LINGUISTICS

Semantics and Ways of Expressing the Plural of Nouns in Russian and Japanese Languages

On determination of the part-of-speech affiliation of modal words

Buchnąć z futrówy, odpypić z ponsy – about vocabulary from the category „cheating” in student jargon from the turn of the 19th and 20th centuries

THE LINGUISTIC CORPUS VEPKAR IS A LANGUAGE REFUGE FOR THE BALTICFINNISH LANGUAGES OF KARELIA

Means of informal communication in political cartoons

Proper names from story recall are associated with beta-amyloid in cognitively unimpaired adults at risk for Alzheimer's disease

Positional verbs: derivation semantics and functioning

Miejsce słowotwórstwa w gramatyce pisanej według formuły „treść &gt; forma”

Semantic (Ir)regularities in Action Nouns in Irish

Lexical Borrowing, Categorization, and Mental Representation

Representational Similarity Mapping of Distributional Semantics in Left Inferior Frontal, Middle Temporal, and Motor Cortex.

Noun Gender in Romanian, a Lexical-Semantic Category

Emotional arousal and lexical specificity modulate response times differently depending on ear of presentation in a dichotic listening task

Norma categorial para el español de Bogotá, Colombia.

Antonymy on historical aspect

Verbal fluency impairments among middle-aged and older outpatients with schizophrenia are characterized by deficient switching

De la "naturalité" des catégories sémantiques : des catégories d'objets naturels aux catégories lexicales

Deep dyslexia in a Dutch-speaking patient

Miejsce słowotwórstwa w gramatyce pisanej według formuły „treść > forma”