Towards a top-down approach for an automatic discourse analysis for Basque: Segmentation and Central Unit detection tool.

Aitziber Atutxa,Arantza Diaz De Ilarraza,Mikel Iruskieta,Kepa Bengoetxea

doi:10.1371/journal.pone.0221639

Abstract

Lately, discourse structure has received considerable attention due to the benefits its application offers in several NLP tasks such as opinion mining, summarization, question answering, text simplification, among others. When automatically analyzing texts, discourse parsers typically perform two different tasks: i) identification of basic discourse units (text segmentation) ii) linking discourse units by means of discourse relations, building structures such as trees or graphs. The resulting discourse structures are, in general terms, accurate at intra-sentence discourse-level relations, however they fail to capture the correct inter-sentence relations. Detecting the main discourse unit (the Central Unit) is helpful for discourse analyzers (and also for manual annotation) in improving their results in rhetorical labeling. Bearing this in mind, we set out to build the first two steps of a discourse parser following a top-down strategy: i) to find discourse units, ii) to detect the Central Unit. The final step, i.e. assigning rhetorical relations, remains to be worked on in the immediate future. In accordance with this strategy, our paper presents a tool consisting of a discourse segmenter and an automatic Central Unit detector.

Highlights

Our linguistic understanding about how to exploit the discourse properties of a text has grown in many ways, as described by [1]
Some disagreements in relations are a consequence of a lack of agreements in the attachment locus which happens to be greater at inter-sentential level
With the future objective of developing a complete discourse parser, this work aims to build and evaluate automatic discourse segmentation and Central Unit detector based on neural networks, in order to use this partial parser in different NLP tasks: i) summarization [2], ii) complex question answering [3] iii) opinion mining [4] and sentiment analysis [5,6,7] iv) evaluation of scholars’ summaries [34]

Summary

Introduction

Our linguistic understanding about how to exploit the discourse properties of a text has grown in many ways, as described by [1]. Discourse parsing is a very challenging task and several authors have shown that discourse structure is crucial in obtaining a better understanding of texts. Exploiting discourse structure information adequately could be the key to improving different NLP tasks such as: i) summarization [2], ii) complex question answering [3] iii) opinion mining [4] and sentiment analysis [5,6,7]. Our approach to discourse here follows Rhetorical Structure Theory (RST) [8], a discourse theory that describes coherence of a text with rhetorical relations between text-spans forming a hierarchical discourse tree (RS-tree).

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Sep 4, 2019
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Towards a top-down approach for an automatic discourse analysis for Basque: Segmentation and Central Unit detection tool.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Chinese Paragraph level Discourse Parsing with Global Backward and Local Reverse Reading

-

25 Nov 2020
25 Nov 2020

Chinese Paragraph-level Discourse Parsing with Global Backward and Local Reverse Reading
Feng Jiang ... Qiaoming Zhu
-
Feng Jiang, et. al.Feng Jiang ... Qiaoming Zhu
01 Jan 2020
01 Jan 2020

The function of discourse particles: A study with special reference to spoken standard French By M.-B. Mosegaard Hansen (review)
Suzanne Fleischman
Language | VOL. 75
Suzanne FleischmanSuzanne Fleischman
01 Mar 1999
Language | VOL. 75

Representation learning in discourse parsing: A survey
Wei Song ... Lizhen Liu
Science China Technological Sciences | VOL. 63
Wei Song, et. al.Wei Song ... Lizhen Liu
16 Sep 2020
Science China Technological Sciences | VOL. 63

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards a top-down approach for an automatic discourse analysis for Basque: Segmentation and Central Unit detection tool.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE