A Canonical Context-Preserving Representation for Open IE: Extracting Semantically Typed Relational Tuples from Complex Sentences

Christina Niklaus,Matthias Cetto,André Freitas,Siegfried Handschuh

doi:10.1016/j.knosys.2023.110455

Christina Niklaus, Matthias Cetto + Show 2 more

Open Access

https://doi.org/10.1016/j.knosys.2023.110455

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Modern systems that deal with inference in texts need automatized methods to extract meaning representations (MRs) from texts at scale. Open Information Extraction (IE) is a prominent way of extracting all potential relations from a given text in a comprehensive manner. Previous work in this area has mainly focused on the extraction of isolated relational tuples. Ignoring the cohesive nature of texts where important contextual information is spread across clauses or sentences, state-of-the-art Open IE approaches are thus prone to generating a loose arrangement of tuples that lack the expressiveness needed to infer the true meaning of complex assertions.To overcome this limitation, we present a method that allows existing Open IE systems to enrich their output with additional meta information. By leveraging the semantic hierarchy of minimal propositions generated by the discourse-aware Text Simplification (TS) approach presented in Niklaus et al. (2019), we propose a mechanism to extract semantically typed relational tuples from complex source sentences. Based on this novel type of output, we introduce a lightweight semantic representation for Open IE in the form of normalized and context-preserving relational tuples. It extends the shallow semantic representation of state-of-the-art approaches in the form of predicate-argument structures by capturing intra-sentential rhetorical structures and hierarchical relationships between the relational tuples. In that way, the semantic context of the extracted tuples is preserved, resulting in more informative and coherent predicate-argument structures which are easier to interpret.In addition, in a comparative analysis, we show that the semantic hierarchy of minimal propositions benefits Open IE approaches in a second dimension: the canonical structure of the simplified sentences is easier to process and analyze, and thus facilitates the extraction of relational tuples, resulting in an improved precision (up to 32%) and recall (up to 30%) of the extracted relations on a large benchmark corpus.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

A Canonical Context-Preserving Representation for Open IE: Extracting Semantically Typed Relational Tuples from Complex Sentences

Abstract

Published Version

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: Mar 15, 2023
License type: cc-by

Similar Papers

Integrating Local Context and Global Cohesiveness for Open Information Extraction
Qi Zhu ... Jiawei Han
-
Qi Zhu, et. al.Qi Zhu ... Jiawei Han
30 Jan 2019
30 Jan 2019

Open Information Extraction with Global Structure Constraints
Qi Zhu ... Yu Zhang
-
Qi Zhu, et. al.Qi Zhu ... Yu Zhang
01 Jan 2018
01 Jan 2018

In Layman’s Terms: Semi-Open Relation Extraction from Scientific Texts
Ruben Kruiper ... Jessica Chen-Burger
-
Ruben Kruiper, et. al.Ruben Kruiper ... Jessica Chen-Burger
01 Jan 2020
01 Jan 2020

Improving Open Information Extraction for Informal Web Documents with Ripple-Down Rules
Myung Hee Kim ... Paul Compton
-
Myung Hee Kim, et. al.Myung Hee Kim ... Paul Compton
01 Jan 2012
01 Jan 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

A Canonical Context-Preserving Representation for Open IE: Extracting Semantically Typed Relational Tuples from Complex Sentences

Abstract

Published Version

Talk to us

Similar Papers

More From: Knowledge-Based Systems