EusDisParser: improving an under-resourced discourse parser with cross-lingual data

Mikel Iruskieta,Chloé Braud

doi:10.18653/v1/w19-2709

Abstract

Development of discourse parsers to annotate the relational discourse structure of a text is crucial for many downstream tasks. However, most of the existing work focuses on English, assuming a quite large dataset. Discourse data have been annotated for Basque, but training a system on these data is challenging since the corpus is very small. In this paper, we create the first demonstrator based on RST for Basque, and we investigate the use of data in another language to improve the performance of a Basque discourse parser. More precisely, we build a monolingual system using the small set of data available and investigate the use of multilingual word embeddings to train a system for Basque using data annotated for another language. We found that our approach to building a system limited to the small set of data available for Basque allowed us to get an improvement over previous approaches making use of many data annotated in other languages. At best, we get 34.78 in F1 for the full discourse structure. More data annotation is necessary in order to improve the results obtained with these techniques. We also describe which relations match with the gold standard, in order to understand these results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

EusDisParser: improving an under-resourced discourse parser with cross-lingual data

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2019
Citations: 34	License type: other-oa

Similar Papers

Representation learning in discourse parsing: A survey
Wei Song ... Lizhen Liu
Science China Technological Sciences | VOL. 63
Wei Song, et. al.Wei Song ... Lizhen Liu
16 Sep 2020
Science China Technological Sciences | VOL. 63

Spectral Semi-Supervised Discourse Relation Classification
Robert Fisher ... Reid Simmons
-
Robert Fisher, et. al.Robert Fisher ... Reid Simmons
01 Jan 2015
01 Jan 2015

HILDA: A Discourse Parser Using Support Vector Machine Classification
Hugo Hernault ... Mitsuru Ishizuka
Dialogue & Discourse | VOL. 1
Hugo Hernault, et. al.Hugo Hernault ... Mitsuru Ishizuka
10 Dec 2010
Dialogue & Discourse | VOL. 1

Discourse Parsing, Automatic
D Marcu
-
D MarcuD Marcu
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

EusDisParser: improving an under-resourced discourse parser with cross-lingual data

Abstract

Talk to us

Similar Papers