Discourse Annotation of Non-native Spontaneous Spoken Responses Using the Rhetorical Structure Theory Framework

Xinhao Wang,James Bruno,Keelan Evanini,Klaus Zechner,Hillary Molloy

doi:10.18653/v1/p17-2041

Xinhao Wang, James Bruno + Show 3 more

Open Access

PDF Available

https://doi.org/10.18653/v1/p17-2041

Copy DOI

Export

Save

Cite

Publication Date: Jan 1, 2017
Citations: 6	License type: cc-by

Affiliation: Princeton University

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

The availability of the Rhetorical Structure Theory (RST) Discourse Treebank has spurred substantial research into discourse analysis of written texts; however, limited research has been conducted to date on RST annotation and parsing of spoken language, in particular, non-native spontaneous speech. Considering that the measurement of discourse coherence is typically a key metric in human scoring rubrics for assessments of spoken language, we initiated a research effort to obtain RST annotations of a large number of non-native spoken responses from a standardized assessment of academic English proficiency. The resulting inter-annotator kappa agreements on the three different levels of Span, Nuclearity, and Relation are 0.848, 0.766, and 0.653, respectively. Furthermore, a set of features was explored to evaluate the discourse structure of non-native spontaneous speech based on these annotations; the highest performing feature resulted in a correlation of 0.612 with scores of discourse coherence provided by expert human raters.

Highlights

The spread of English as the global language of education and commerce is continuing, and there is a strong interest in developing assessment systems that can automatically score spontaneous speech from non-native speakers with the goals of reducing the burden on human raters, improving reliability, and generating feedback that can be used by language learners
This study aims to construct a discourselevel annotation of non-native spontaneous speech in the framework of Rhetorical Structure Theory (RST) (Mann and Thompson, 1988), which can be used in automated discourse analysis and coherence measurement for non-native spoken responses, thereby improving the validity of the automated scoring systems
The availability of RST annotations on a selection of 385 Wall Street Journal articles from the Penn Treebank1 (Carlson et al, 2001) has facilitated RST-based discourse analysis of written texts, since it provides a standard benchmark for comparing the performance of different techniques for document-level discourse parsing (Joty et al, 2013; Feng and Hirst, 2014)

Summary

Introduction

The spread of English as the global language of education and commerce is continuing, and there is a strong interest in developing assessment systems that can automatically score spontaneous speech from non-native speakers with the goals of reducing the burden on human raters, improving reliability, and generating feedback that can be used by language learners. Tonelli et al adapted the PDTB annotation scheme to annotate discourse relations in spontaneous conversations in Italian (Tonelli et al, 2010) and Rehbein et al compared two frameworks, PDTB and CCR (Cognitive approach to Coherence Relations) (Sanders et al, 1992), for the annotation of discourse relations in spoken language (Rehbein et al, 2016) In contrast to these previous studies, this study focuses on monologic spoken responses produced by non-native speakers within the context of a language proficiency assessment. These discourse coherence scores are reused in the current study (along with the holistic profiency scores presented above) to evaluate the performance of features measuring discourse coherence based on the RST annotations

Guidelines

Pilot Annotation

Formal Annotation

Discourse Features

Conclusion and Future Work