Dscorer: A Fast Evaluation Metric for Discourse Representation Structure Parsing

Jiangming Liu,Shay B Cohen,Mirella Lapata

doi:10.18653/v1/2020.acl-main.416

Abstract

Discourse representation structures (DRSs) are scoped semantic representations for texts of arbitrary length. Evaluating the accuracy of predicted DRSs plays a key role in developing semantic parsers and improving their performance. DRSs are typically visualized as boxes which are not straightforward to process automatically. Counter transforms DRSs to clauses and measures clause overlap by searching for variable mappings between two DRSs. However, this metric is computationally costly (with respect to memory and CPU time) and does not scale with longer texts. We introduce Dscorer, an efficient new metric which converts box-style DRSs to graphs and then measures the overlap of n-grams. Experiments show that Dscorer computes accuracy scores that are correlated with Counter at a fraction of the time.

Highlights

Discourse Representation Theory (DRT) is a popular theory of meaning representation (Kamp, 1981; Kamp and Reyle, 2013; Asher, 1993; Asher et al, 2003) designed to account for a variety of linguistic phenomena within and across sentences
DSCORER, converts box-style Discourse Representation Structures (DRSs) to a graph format used for evaluation and computes F1 with high efficiency (7,000 times faster compared to COUNTER)
Due to the high sparsity of graphs typical for DRSs, the number of n-grams does not explode as the size of graphs increases, |G| = |N | + |E|, where |N | and |E| are the number of nodes and edges in graph G, respectively

Summary

Introduction

Discourse Representation Theory (DRT) is a popular theory of meaning representation (Kamp, 1981; Kamp and Reyle, 2013; Asher, 1993; Asher et al, 2003) designed to account for a variety of linguistic phenomena within and across sentences. This incurs unacceptable runtime, especially when evaluating document-level DRSs with a large number of variables Another problem with the current evaluation is that COUNTER only considers local clauses without taking larger window sizes into account. It considers “b4 sing e2” and “b3 NOT b4” as separate semantic units. In DRS parsing, there is a single correct semantic representation (goldstandard reference) and no limit on the maximum size of DRSs. Our proposed metric, DSCORER, converts box-style DRSs to a graph format used for evaluation and computes F1 with high efficiency (7,000 times faster compared to COUNTER). We release our code, implementing the metric, at https: //github.com/LeonCrashCode/DRSScorer

Graph Induction

Evaluation Based on n-grams

Experiments

Metric Settings

Analysis

Case Study

Related Work

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dscorer: A Fast Evaluation Metric for Discourse Representation Structure Parsing

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 17	License type: cc-by

Similar Papers

Discourse representation structures for ACE 5
...
-
, et. al. ...
01 Jan 2006
01 Jan 2006

The First Shared Task on Discourse Representation Structure Parsing
Lasha Abzianidze ... Johan Bos
-
Lasha Abzianidze, et. al.Lasha Abzianidze ... Johan Bos
01 Jan 2019
01 Jan 2019

Processing Coordinated Structures in PENG Light
Rolf Schwitter
-
Rolf SchwitterRolf Schwitter
01 Jan 2010
01 Jan 2010

DRS at MRP 2020: Dressing up Discourse Representation Structures as Graphs
Lasha Abzianidze ... Stephan Oepen
-
Lasha Abzianidze, et. al.Lasha Abzianidze ... Stephan Oepen
01 Jan 2020
DRS at MRP 2020: Dressing up Discourse Representation Structures as Graphs
Lasha Abzianidze ... Stephan Oepen

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dscorer: A Fast Evaluation Metric for Discourse Representation Structure Parsing

Abstract

Highlights

Summary

Talk to us

Similar Papers