Text-level Discourse Dependency Parsing

Sujian Li,Wenjie Li,Liang Wang,Ziqiang Cao

doi:10.3115/v1/p14-1003

Abstract

Previous researches on Text-level discourse parsing mainly made use of constituency structure to parse the whole document into one discourse tree. In this paper, we present the limitations of constituency based discourse parsing and first propose to use dependency structure to directly represent the relations between elementary discourse units (EDUs). The state-of-the-art dependency parsing techniques, the Eisner algorithm and maximum spanning tree (MST) algorithm, are adopted to parse an optimal discourse dependency tree based on the arcfactored model and the large-margin learning techniques. Experiments show that our discourse dependency parsers achieve a competitive performance on text-level discourse parsing.

Highlights

It is widely agreed that no units of the text can be understood in isolation, but in relation to their context
The rhetorical relations in Rhetorical Structure Theory (RST) trees are kept as the functional relations which link the two Elementary Discourse Units (EDUs) in dependency trees
Following (Feng and Hirst, 2012; Lin et al, 2009; Hernault et al, 2010b), we explore the following 6 feature types combined with relations to represent each labeled arc . (1) WORD: The first one word, the last one word, and the first bigrams in each EDU, the pair of the two first words and the pair of the two last words in the two EDUs are extracted as features

Summary

Introduction

It is widely agreed that no units of the text can be understood in isolation, but in relation to their context. The leaves of a tree correspond to contiguous text spans called Elementary Discourse Units (EDUs). The different levels of discourse units (e.g. EDUs or larger text spans) occurring in the generative process are better represented with different features, and a uniform framework for discourse analysis is hard to develop. We adopt the graph based dependency parsing techniques learned from large sets of annotated dependency trees. The Eisner (1996) algorithm and maximum spanning tree (MST) algorithm are used respectively to parse the optimal projective and non-projective dependency trees with the large-margin learning technique (Crammer and Singer, 2003).

Discourse Dependency Structure

Our Discourse Dependency Treebank

System Overview

Eisner Algorithm

Maximum Spanning Tree Algorithm

Learning

Features

MIRA based Learning

Preparation

Feature Influence on Two Relation Sets

Method Features

Comparison with Other Systems

Findings

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Text-level Discourse Dependency Parsing

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2014
Citations: 94	License type: cc-by

Similar Papers

Summarized Logical Forms Based on Abstract Meaning Representation and Discourse Trees
Boris Galitsky
-
Boris GalitskyBoris Galitsky
01 Jan 2020
01 Jan 2020

A syntactic and lexical-based discourse segmenter
Milan Tofiloski ... Maite Taboada
-
Milan Tofiloski, et. al.Milan Tofiloski ... Maite Taboada
01 Jan 2009
01 Jan 2009

Aspect-Based Sentiment Analysis Through EDU-Level Attentions
Ting Lin ... Aixin Sun
-
Ting Lin, et. al.Ting Lin ... Aixin Sun
01 Jan 2021
01 Jan 2021

A Unified Linear-Time Framework for Sentence-Level Discourse Parsing
Xiang Lin ... Shafiq Joty
-
Xiang Lin, et. al.Xiang Lin ... Shafiq Joty
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text-level Discourse Dependency Parsing

Abstract

Highlights

Summary

Talk to us

Similar Papers