Speech Corpus Research Articles

The article formulates the subject and tasks of the future speech-genre discipline – the integral description of speech genres – which, in the author’s opinion, claims to be central to the general theory of speech genres. The main sources of the theory of speech genres integrality include the high level of a speech genre as a speech and language unit, the complex, multidimensional and multicomponent nature of the theory of speech genres, the multidisciplinarity of the theory of speech genres (it draws on data from almost all the humanities and partly non-humanities). The integral description of speech genres will include the following parameters: the cultural and historical context of the speech genre; typological characteristics of speech genre (including their place in various typologies); speech-genre variantology and recurrence, including new technogenic and Internet derivatives from primary, traditional genres; representation in the corps. The integral description of speech genres and its parameters are discussed in connection with the division of linguistics (partly – other sciences, for example, literary criticism). It is shown that the majority of divisions of linguistics that are significant for the integral description of speech genres go back to Saussure’s idea of opposing internal and external linguistics, and the main tasks of the theory of speech genres correspond to those three “eternal” problems of linguistics formulated by V. M. Alpatov: “How is language structured?”, “How does language function?” and “How does language develop?” The author discusses directions of the most important genre studies, from which the integral description of speech genres should be “composed”, and the criteria used for their selection: 1) the model is already integral (multi-component and interdisciplinary); 2) universal (applicable to many languages and national cultures, in different historical periods); 3) based on this model, the largest number of studies was carried out. The first criterion is met by the model of T. V. Shmeleva, the second – by the universalist model of A. Wierzbicka, the third – by traditional speech-genre models: lexical, syntactic and pragmatic. In conclusion, the author discusses the unresolved problems of speech genres, which can be logically solved on the basis of the integral description of speech genres: typology (integral bases for the classification of speech genres claim to be more reliable), variantology, applied and experimental aspects (vocabulary representation of speech genres and corpus aspect, including the analysis of key phrases of speech genres).

Read full abstract

Purpose: This research has several objectives. First, determine lexical density and compare the lexical density. Second, to determine the key lexical density and compare the key lexical density. Third, to test the independence of the relationship between lexical variations and the text of President Joko Widodo's and President Susilo Bambang Yudhoyono's speeches. Theoretical Reference: The theoretical basis used in this research is the lexical analysis approach in linguistics. The application of lexical perspective analysis is expected to be able to review the communication used by each individual. The theoretical lexical discussion will also use a statistical independence analysis approach. The application of a statistical independence analysis approach is used to review a person's individual language abilities. Method: This research uses a qualitative and quantitative corpus linguistics approach. The corpus linguistic application used in this research is the KORTARA application (Korpus Nusantara). The research data is a corpus of 9 texts of President Joko Widodo's speeches and a corpus of 9 texts of President Susilo Bambang Yudhoyono which are official speeches every 16 August before the DPR of the Republic of Indonesia. Results and Conclusion: The results of this research reveal that the text corpus of President Joko Widodo's speech is richer and more varied than the text corpus of President Susilo Bambang Yudhoyono's speech in lexical use. This research also revealed that there is a relationship between lexical variation and the type of text of the President of the Republic of Indonesia's speech with a confidence level of 95%. The difference in lexical variation and frequency between the text corpus of President Joko Widodo's speech and the text corpus of Susilo Bambang Yudhoyono's speech is statistically significant at p < 0.05. Implication of Research: The implication of this research is the realization of the KORTARA corpus linguistic approach (Korpus Nusantara) which can facilitate research for small and large scale data. This research also reveals that the application of a statistical approach provides maximum results in the analysis of large-scale linguistic phenomena. Originality/value: The current study makes a valuable empirical contribution by combining statistical analysis using corpus and qualititative analysis to give comprehensive conclusion. This study is the answer toward the question about the reliability and validity of linguistic studies.

Read full abstract

Speech Corpus Research Articles

Related Topics

Articles published on Speech Corpus

Comparison of Psychometric Functions Measured Using Remote Testing and Laboratory Testing.

Creation of a diverse mixed-lingual emotional speech corpus with a framework for enhanced emotion detection

Customized deep learning based Turkish automatic speech recognition system supported by language model.

Automatic speaker and age identification of children from raw speech using sincNet over ERB scale

Positional error distribution in Thai speakers’ acquisition of Korean stop consonants: A speech corpus analysis

Classification of Manners of the Prevocalic Alveolar Consonants in Machine Learning Using Dynamic Formant Transitions of Vowels in Korean Spontaneous Speech

Редукция на неударените гласни в съвременния български книжовен език: съпоставка на общоприети възгледи с корпусни данни

Disambiguation of Isolated Manipuri Tonal Contrast Word Pairs Using Acoustic Features

Classification patterns in conversational English fricatives: Between- and within- speaker analyses

Spectral properties of Quebec French sibilants

Modern Standard Arabic speech disorders corpus for digital speech processing applications

Location of constriction in velar sounds in French

The fundamental frequency (f0) distribution of American speakers in a spontaneous speech corpus*

Regional differences in the production of tones in standard mandarin

К проблеме интегрального описания речевых жанров

MECOS: A bilingual Manipuri–English spontaneous code-switching speech corpus for automatic speech recognition

COMPARISON OF THE SPEECH TEXTS OF INDONESIAN PRESIDENT JOKO WIDODO AND PRESIDENT SUSILO BAMBANG YUDHOYONO: STUDY USING A CORPUS LINGUISTIC APPROACH

Denoising Convolutional Autoencoder Based Approach for Disordered Speech Recognition

Evaluating the Efficacy of Traditional Machine Learning Models in Speaker Recognition: A Comparative Study Using the LibriSpeech Dataset

How Do Care Partners of People with Rare Dementia Use Language in Online Peer Support Groups? A Quantitative Text Analysis Study

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Speech Corpus Research Articles

Related Topics

Articles published on Speech Corpus

Comparison of Psychometric Functions Measured Using Remote Testing and Laboratory Testing.

Creation of a diverse mixed-lingual emotional speech corpus with a framework for enhanced emotion detection

Customized deep learning based Turkish automatic speech recognition system supported by language model.

Automatic speaker and age identification of children from raw speech using sincNet over ERB scale

Positional error distribution in Thai speakers’ acquisition of Korean stop consonants: A speech corpus analysis

Classification of Manners of the Prevocalic Alveolar Consonants in Machine Learning Using Dynamic Formant Transitions of Vowels in Korean Spontaneous Speech

Редукция на неударените гласни в съвременния български книжовен език: съпоставка на общоприети възгледи с корпусни данни

Disambiguation of Isolated Manipuri Tonal Contrast Word Pairs Using Acoustic Features

Classification patterns in conversational English fricatives: Between- and within- speaker analyses

Spectral properties of Quebec French sibilants

Modern Standard Arabic speech disorders corpus for digital speech processing applications

Location of constriction in velar sounds in French

The fundamental frequency (f0) distribution of American speakers in a spontaneous speech corpus*

Regional differences in the production of tones in standard mandarin

К проблеме интегрального описания речевых жанров

MECOS: A bilingual Manipuri–English spontaneous code-switching speech corpus for automatic speech recognition

COMPARISON OF THE SPEECH TEXTS OF INDONESIAN PRESIDENT JOKO WIDODO AND PRESIDENT SUSILO BAMBANG YUDHOYONO: STUDY USING A CORPUS LINGUISTIC APPROACH

Denoising Convolutional Autoencoder Based Approach for Disordered Speech Recognition

Evaluating the Efficacy of Traditional Machine Learning Models in Speaker Recognition: A Comparative Study Using the LibriSpeech Dataset

How Do Care Partners of People with Rare Dementia Use Language in Online Peer Support Groups? A Quantitative Text Analysis Study