OpenID Research Articles

While Large Language Models (LLMs) have significantly advanced various benchmarks in Natural Language Processing (NLP), the challenge of low-resource tasks persists, primarily due to the scarcity of data and difficulties in annotation. This study introduces LoRE, a framework designed for zero-shot relation extraction in low-resource settings, which blends distant supervision with the powerful capabilities of LLMs. LoRE addresses the challenges of data sparsity and noise inherent in traditional distant supervision methods, enabling high-quality relation extraction without requiring extensive labeled data. By leveraging LLMs for zero-shot open information extraction and incorporating heuristic entity and relation alignment with semantic disambiguation, LoRE enhances the accuracy and relevance of the extracted data. Low-resource tasks refer to scenarios where labeled data are extremely limited, making traditional supervised learning approaches impractical. This study aims to develop a robust framework that not only tackles these challenges but also demonstrates the theoretical and practical implications of zero-shot relation extraction. The Chinese Person Relationship Extraction (CPRE) dataset, developed under this framework, demonstrates LoRE’s proficiency in extracting person-related triples. The CPRE dataset consists of 1000 word pairs, capturing diverse semantic relationships. Extensive experiments on the CPRE, IPRE, and DuIE datasets show significant improvements in dataset quality and a reduction in manual annotation efforts. These findings highlight the potential of LoRE to advance both the theoretical understanding and practical applications of relation extraction in low-resource settings. Notably, the performance of LoRE on the manually annotated DuIE dataset attests to the quality of the CPRE dataset, rivaling that of manually curated datasets, and highlights LoRE’s potential for reducing the complexities and costs associated with dataset construction for zero-shot and low-resource tasks.

Read full abstract

Modern systems that deal with inference in texts need automatized methods to extract meaning representations (MRs) from texts at scale. Open Information Extraction (IE) is a prominent way of extracting all potential relations from a given text in a comprehensive manner. Previous work in this area has mainly focused on the extraction of isolated relational tuples. Ignoring the cohesive nature of texts where important contextual information is spread across clauses or sentences, state-of-the-art Open IE approaches are thus prone to generating a loose arrangement of tuples that lack the expressiveness needed to infer the true meaning of complex assertions.To overcome this limitation, we present a method that allows existing Open IE systems to enrich their output with additional meta information. By leveraging the semantic hierarchy of minimal propositions generated by the discourse-aware Text Simplification (TS) approach presented in Niklaus et al. (2019), we propose a mechanism to extract semantically typed relational tuples from complex source sentences. Based on this novel type of output, we introduce a lightweight semantic representation for Open IE in the form of normalized and context-preserving relational tuples. It extends the shallow semantic representation of state-of-the-art approaches in the form of predicate-argument structures by capturing intra-sentential rhetorical structures and hierarchical relationships between the relational tuples. In that way, the semantic context of the extracted tuples is preserved, resulting in more informative and coherent predicate-argument structures which are easier to interpret.In addition, in a comparative analysis, we show that the semantic hierarchy of minimal propositions benefits Open IE approaches in a second dimension: the canonical structure of the simplified sentences is easier to process and analyze, and thus facilitates the extraction of relational tuples, resulting in an improved precision (up to 32%) and recall (up to 30%) of the extracted relations on a large benchmark corpus.

Read full abstract

OpenID Research Articles

Related Topics

Articles published on OpenID

A Zero-Shot Framework for Low-Resource Relation Extraction via Distant Supervision and Large Language Models

Multi-level feature interaction for open knowledge base canonicalization

Integrating Arabic Open Information Extraction Model by Machine Learning to Extract Relation

Cnosso, a novel method for business document automation based on open information extraction

Structuring Information from Plant Morphological Descriptions using Open Information Extraction

OIE4PA: open information extraction for the public administration

AI-based decision support system for public procurement

Spatio-Temporal Relevance Classification from Geographic Texts Using Deep Learning

A Canonical Context-Preserving Representation for Open IE: Extracting Semantically Typed Relational Tuples from Complex Sentences

SailGenie: SAiling expertIse to knowLedge Graph through opEN Information Extraction

Open Information Extraction Methodology for a New Curated Biomedical Literature Dataset

An Online Parsing Framework for Semistructured Streaming System Logs of Internet of Things Systems

DptOIE: a Portuguese open information extraction based on dependency analysis

Open Information Extraction from Texts: Part III. Question Answering over an Automatically Constructed Knowledge Base

DetIE: Multilingual Open Information Extraction Inspired by Object Detection

An adaptable, high-performance relation extraction system for complex sentences

Adding an Inception Network to Neural Network Open Information Extraction

Semi-pilot cell based maximum power point tracking and coordinated current control based PMSM drive for standalone solar water pumping system

A meta learning approach for open information extraction

Why does the president tweet this? Discovering reasons and contexts for politicians’ tweets from news articles

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

OpenID Research Articles

Related Topics

Articles published on OpenID

A Zero-Shot Framework for Low-Resource Relation Extraction via Distant Supervision and Large Language Models

Multi-level feature interaction for open knowledge base canonicalization

Integrating Arabic Open Information Extraction Model by Machine Learning to Extract Relation

Cnosso, a novel method for business document automation based on open information extraction

Structuring Information from Plant Morphological Descriptions using Open Information Extraction

OIE4PA: open information extraction for the public administration

AI-based decision support system for public procurement

Spatio-Temporal Relevance Classification from Geographic Texts Using Deep Learning

A Canonical Context-Preserving Representation for Open IE: Extracting Semantically Typed Relational Tuples from Complex Sentences

SailGenie: SAiling expertIse to knowLedge Graph through opEN Information Extraction

Open Information Extraction Methodology for a New Curated Biomedical Literature Dataset

An Online Parsing Framework for Semistructured Streaming System Logs of Internet of Things Systems

DptOIE: a Portuguese open information extraction based on dependency analysis

Open Information Extraction from Texts: Part III. Question Answering over an Automatically Constructed Knowledge Base

DetIE: Multilingual Open Information Extraction Inspired by Object Detection

An adaptable, high-performance relation extraction system for complex sentences

Adding an Inception Network to Neural Network Open Information Extraction

Semi-pilot cell based maximum power point tracking and coordinated current control based PMSM drive for standalone solar water pumping system

A meta learning approach for open information extraction

Why does the president tweet this? Discovering reasons and contexts for politicians’ tweets from news articles