Dialect Corpus Research Articles

The article examines the symbolic component of the semantics of nominations for items of clothing in Russian traditional costume. The symbolic component of the meaning of the analyzed lexical units was identified through linguoculturological analysis. With its help, representatives are identified that testify to it as a cultural sign. The purpose of linguocultural analysis is to determine the totality of connotative components of the meaning of a word, including the symbolic one. To optimally structure a commentary, an answer to several questions is necessary: identifying the degree of objectivity of the procedure, clarifying the scope of its content, as well as determining the sequence of inclusion of components through which the symbolic semantics of a word is represented. The aim of the work is to determine the specifics of objectification of the cultural-symbolic component of the semantics of clothing nominations. The objectives are to identify lexical representative units that have a symbolic component of meaning in their semantics and to determine the set of symbolic components of meaning in clothing names. The research material was data from the dialects of the Middle Ob region recorded in the dictionaries of the Tomsk Dialectological School and the database of the Tomsk Dialect Corpus. The empirical basis of the work was formed using a continuous sampling method. The article implements the general scientific method of description. Methods of interpretation and linguocultural commentary are used. Techniques of contextual analysis, classification, systematization are the main ones when working with the studied layer of lexical units. The illustrative material included in the article contains facts of actualization of the symbolic component of clothing nominations in living dialect speech. Linguoculturological analysis, which allows stating that there is a symbolic connotative component in the semantics of the studied units, shows the idea of the symbolic in culture, which is firmly rooted in the archaic consciousness because it establishes a model of integral existence, knowledge about the orderliness of the world on the principles of correlation between phenomena and processes.

Read full abstract

Hyper vectors are holographic and randomly processed with independent and identically distributed tools. A hyper vector includes whole data merged as well as spread completely on its pieces as an encompassing portrayal. So, no spot is more dependable to store any snippet of data compared to others. Hyper vectors are joined with tasks likened to expansion, and changed the structure of numerical processing on vector regions. Hyper vectors are intended to analyze the closeness utilizing a separation metric over the vector region. These activities are nothing but hyper vectors in which it can be joined into intriguing processing conduct with novel highlights which make them vigorous and proficient. This paper focuses on a utilization of hyper dimensional processing for distinguishing the language of text tests for encoding sequential letters into hyper vectors. Perceiving the language of a given book is the initial phase in all sorts of language handling. Examples: text examination, arrangement, and interpretation. High dimension vector models are mainstream in Natural Language Processing and are utilized to catch word significance from word insights. In this research work, the first task is high dimensional computing classification, based on Arabic datasets which contain three datasets such as Arabiya, Khaleej and Akhbarona. High dimensional computing is applied to obtain the results from the previous dataset when it is applied to N-gram encoding. When utilizing SANAD single-label Arabic news articles datasets with 12 N-gram encoding, the accuracy of high computing is 0.9665%. The high dimensional computing with 6 N-gram encoding while utilizing RTA dataset, provides the accuracy of 0.6648%. ANT dataset with 12 N-gram encoding in high dimensional computing gives the accuracy 0.9248%. The second task is applying high dimensional computing on Arabic language recognition for Levantine dialects three dataset is utilized. The first dataset is SDC Shami Dialects Corpus which contains Jordanian, Lebanese, Palestinian and Syrian. The same provides an accuracy of 0.8234% while it is applied to high dimensional computing with 7 N-gram encoding. PADIC (Parallel Arabic dialect corpus) is the second dataset which contains Syria and Palestine Arabic dialects that provide an accuracy of 0.7458% when applied high dimensional computing with 5 N-gram encoding. The high dimensional computing when applied to third dataset MADAR (Multi-Arabic dialect applications and resources) with 6 N-gram encoding provides the accuracy rate of 0.7800%.

Read full abstract

Dialect Corpus Research Articles

Articles published on Dialect Corpus

Spor etter det samiske kasussystemet i indrefinnmarksmålet

Complementizer drop and clitic climbing in Torlakian: Evidence from the Timok variety

Emotion Recognition in Kurdish Speech from the Sorani Dialect Corpus

Lexicon-Based Approach in Sentiment Analysis of Yemeni Dialect for Social Media Network

Advancing AI-Driven Linguistic Analysis: Developing and Annotating Comprehensive Arabic Dialect Corpora for Gulf Countries and Saudi Arabia

Annotation and evaluation of a dialectal Arabic sentiment corpus against benchmark datasets using transformers

An In-depth Corpus-based Investigation of Unbound Reflexives

Chinese dialect speech recognition: a comprehensive survey

Символический компонент семантики номинаций одежды (на материале говоров Среднего Приобья)

The Role of Students in the Revitalisation of Dialects

The Tomsk Dialect Corpus: a comprehensively annotated database of a Siberian Russian dialect from material collected over the last 70 years

The voice as a material clue: a new forensic Algerian Corpus.

Concept “Document” in the Speech of Peasants (Based on the Materials of the Tomsk Dialect Corpus)

Геоинформационный веб-ресурс «Диалектный корпус бурятского языка»

The pragmatics and syntax of pronoun preposing

Nasal epenthesis in preverbal accusative clitic pronouns. A variationist study of present-day dialectal European Portuguese

High dimensional autonomous computing on Arabic language classification

Constructing twitter corpus of Iraqi Arabic Dialect (CIAD) for sentiment analysis

Region-level arabic dialect identification using deep learning models

Жанровая разметка Томского диалектного корпуса: от концепции - к реализации

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Dialect Corpus Research Articles

Articles published on Dialect Corpus

Spor etter det samiske kasussystemet i indrefinnmarksmålet

Complementizer drop and clitic climbing in Torlakian: Evidence from the Timok variety

Emotion Recognition in Kurdish Speech from the Sorani Dialect Corpus

Lexicon-Based Approach in Sentiment Analysis of Yemeni Dialect for Social Media Network

Advancing AI-Driven Linguistic Analysis: Developing and Annotating Comprehensive Arabic Dialect Corpora for Gulf Countries and Saudi Arabia

Annotation and evaluation of a dialectal Arabic sentiment corpus against benchmark datasets using transformers

An In-depth Corpus-based Investigation of Unbound Reflexives

Chinese dialect speech recognition: a comprehensive survey

Символический компонент семантики номинаций одежды (на материале говоров Среднего Приобья)

The Role of Students in the Revitalisation of Dialects

The Tomsk Dialect Corpus: a comprehensively annotated database of a Siberian Russian dialect from material collected over the last 70 years

The voice as a material clue: a new forensic Algerian Corpus.

Concept “Document” in the Speech of Peasants (Based on the Materials of the Tomsk Dialect Corpus)

Геоинформационный веб-ресурс «Диалектный корпус бурятского языка»

The pragmatics and syntax of pronoun preposing

Nasal epenthesis in preverbal accusative clitic pronouns. A variationist study of present-day dialectal European Portuguese

High dimensional autonomous computing on Arabic language classification

Constructing twitter corpus of Iraqi Arabic Dialect (CIAD) for sentiment analysis

Region-level arabic dialect identification using deep learning models

Жанровая разметка Томского диалектного корпуса: от концепции - к реализации