Linguistic Data Consortium Research Articles

Abstract We investigate how discourse relations and their subtypes are signalled, extending the set of discourse signals from connectives and lexical cue phrases to the wide range of semantic, syntactic, and orthographic signals of the RST Signalling Corpus (Das, Debopam & Maite Taboada. 2018. RST signalling corpus. Language Resources and Evaluation 52. 149–184). This extension requires re-evaluating previous predictions on discourse signalling, in particular, those of Sanders, Ted. 2005. Coherence, causality and cognitive complexity in discourse. In M. Aurnague, M. Bras, A. Le Draoulec & L. Vieu (eds.), Proceedings/Actes SEM-05, first international symposium on the exploration and modelling of meaning, 105–114. Biarritz causality-by-default hypothesis, the hypothesis of uniform information density (Frank, Austin & Florian Jaeger. 2008. Speaking rationally: Uniform information density as an optimal strategy for language production. In Proceedings of the 30th annual meeting of the Cognitive Science Society, 933–938. https://escholarship.org/uc/item/7d08h6j4 (accessed 18 May 2022)), and the hypothesis that discourse is continuous by preference (Segal, Erwin, Judith Duchan & Paula Scott. 1991. The role of interclausal connectives in narrative structuring. Discourse Processes 14. 27–54; Murray, John. 1997. Connectives and narrative text. Memory and Cognition 25. 227–236). We evaluate the predictions of these theories on the conditional relations in the RST Discourse Treebank (Carlson, Lynn, Daniel Marcu & Mary Ellen Okurowski. 2002. RST Discourse Treebank. LDC2002T07. Philadelphia: Linguistic Data Consortium), using causal relations as a control group. Informativity and continuity are operationalized in terms of semantic complexity and Givón, Talmy. 1993. English grammar: A function-based introduction, vol. 2. Amsterdam: John Benjamins dimensions of deictic shift. Our results show that the hypotheses make accurate predictions only for the relation groups in their entirety but not for the observed in-group variation, in particular, the low amount of marking for the hypothetical subtype of conditional relations. We attribute this difference to the distribution of intra- and inter-sentential occurrences across the conditional subtypes: intra-sentential relations are consistently more marked than inter-sentential ones, and hypothetical relations are special in that they appear predominantly inter-sententially.

Read full abstract

조재현. 2018. 전화 대화에 나타난 한국어 장형 부정 ‘-지 않-’의 스탠스와 주관성 분석. 국제한국어교육 4(2), 81-106. 본 연구는 실제 전화 대화상에서 쓰인 한국어의 장형 부정 표현인 ‘-지 않-’이 드러내는 스탠스와 주관성(subjectivity)을 코퍼스 분석을 통해 살펴본 연구이다. 한국어의 장형과 단형 부정 표현은 전통적으로 통사론의 영역에서 다루 어져 왔는데 주로 그 둘 간의 교체가능성과 이들 각각이 가지는 부정의 범위가 논의의 초점이 되어 왔다. 최근에는 구어의 영역에서 이들 표현의 상호작용적 기능을 밝히고자 하는 연구가 활발하게 진행되어 왔는 데, 기존 연구의 성과에 기여하기 위해 본 연구는 ‘-지 않-’의 기능적, 상호작용적 특성을 아래와 같은 방법을 통해 조명해 보았다. 주관성의 표출과 관련하여 ‘-지 않-’과 빈번하게 함께 쓰이는 주어와 술어의 유형이 있는지, 자연스러운 전화 대화에서 장형 부정 표현이 특정한 스탠 스를 나타내는 수사적 용법으로 쓰이는 사용상의 패턴이 있는지, ‘-지 않-’과 높은 빈도로 함께 쓰이는 특정한 어미 표현이 있는지, 그리고 마지막으로 이들 표현이 대화 상의 어느 위치에 자주 나타나는지에 대한 분석이 이루어졌다. (캘리포니아주립대학교, 로스엔젤레스)Jaehyun Jo. 2018. The Stance and Subjectivity in the Use of the Korean Post-Verbal Negation -ci anh- in the Telephone Conversation. International Journal of Korean Language Education 4(2), 81-106. This corpus-based study demonstrates the results of analyses of the Korean negative structure ‘-ci anh-’ in terms of its subjectivity and stances in naturally occurring telephone conversations. The Korean post-verbal negation ‘-ci anh-’ and pre-verbal negation ‘an’ have been studied in syntax and semantics for its interchangeability and the scope of each negation type. Contributing to more recent studies that attempt to find the interactional functions of these negatives in spoken discourse, this study examines the functional and interactional characteristics of ‘-ci anh-’ structure appearing in the Linguistic Data Consortium (LDC) Call-Friend Korean corpus of 100 telephone calls. In the analyses, the utterances were coded with following information: (1) what subject and predicate types appear with ‘-ci anh-’ in relation to how its subjectivity is established; (2) whether each of ‘-ci anh-’ tokens is a simple negative or a rhetorical question (or statement) that shows the speaker’s specific stance; (3) also with what kind of ending combinations and in which position of one’s turn the post-verbal negation was deployed. (University of California, Los Angeles)

Read full abstract

Linguistic Data Consortium Research Articles

Articles published on Linguistic Data Consortium

An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain

A Language Model Optimization Method for Turkish Automatic Speech Recognition System

Linguistic Data Consortium

Text-Independent Automatic Dialect Recognition of Marathi Language using Spectro-Temporal Characteristics of Voice

Towards developing speaker diarization for parent-child interactions

Signalling conditional relations

Research on an English translation method based on an improved transformer model

G-Cocktail: An Algorithm to Address Cocktail Party Problem of Gujarati Language Using Cat Boost

King Saud University Emotions Corpus: Construction, Analysis, Evaluation, and Comparison

Arabic speaker recognition system based on phoneme fusion

Building a Speech and Text Corpus of Turkish: Large Corpus Collection with Initial Speech Recognition Results

The Stance and Subjectivity in the Use of the Korean Post-Verbal Negation '-ci anh-' in the Telephone Conversation

Improving the performance of the speaker emotion recognition based on low dimension prosody features vector

An Enhanced Latent Semantic Analysis Approach for Arabic Document Summarization

Evaluation of an Arabic Speech Corpus of Emotions: A Perceptual and Statistical Analysis

An Event Relationship Model for Knowledge Organization and Visualization

Multi-lingual geoparsing based on machine translation

RST Signalling Corpus: a corpus of signals of coherence relations

Facial Expression Recognition with Faster R-CNN

Multistage data selection-based unsupervised speaker adaptation for personalized speech emotion recognition

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Linguistic Data Consortium Research Articles

Articles published on Linguistic Data Consortium

An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain

A Language Model Optimization Method for Turkish Automatic Speech Recognition System

Linguistic Data Consortium

Text-Independent Automatic Dialect Recognition of Marathi Language using Spectro-Temporal Characteristics of Voice

Towards developing speaker diarization for parent-child interactions

Signalling conditional relations

Research on an English translation method based on an improved transformer model

G-Cocktail: An Algorithm to Address Cocktail Party Problem of Gujarati Language Using Cat Boost

King Saud University Emotions Corpus: Construction, Analysis, Evaluation, and Comparison

Arabic speaker recognition system based on phoneme fusion

Building a Speech and Text Corpus of Turkish: Large Corpus Collection with Initial Speech Recognition Results

The Stance and Subjectivity in the Use of the Korean Post-Verbal Negation '-ci anh-' in the Telephone Conversation

Improving the performance of the speaker emotion recognition based on low dimension prosody features vector

An Enhanced Latent Semantic Analysis Approach for Arabic Document Summarization

Evaluation of an Arabic Speech Corpus of Emotions: A Perceptual and Statistical Analysis

An Event Relationship Model for Knowledge Organization and Visualization

Multi-lingual geoparsing based on machine translation

RST Signalling Corpus: a corpus of signals of coherence relations

Facial Expression Recognition with Faster R-CNN

Multistage data selection-based unsupervised speaker adaptation for personalized speech emotion recognition