A survey on extraction of causal relations from natural language text

Jie Yang,Soyeon Caren Han,Josiah Poon

doi:10.1007/s10115-022-01665-w

Jie Yang, Soyeon Caren Han + Show 1 more

Open Access

https://doi.org/10.1007/s10115-022-01665-w

Copy DOI

Journal: Knowledge and Information Systems	Publication Date: Mar 12, 2022
Citations: 23	License type: open-access

Affiliation: University of Sydney

Abstract

As an essential component of human cognition, cause–effect relations appear frequently in text, and curating cause–effect relations from text helps in building causal networks for predictive tasks. Existing causality extraction techniques include knowledge-based, statistical machine learning (ML)-based, and deep learning-based approaches. Each method has its advantages and weaknesses. For example, knowledge-based methods are understandable but require extensive manual domain knowledge and have poor cross-domain applicability. Statistical machine learning methods are more automated because of natural language processing (NLP) toolkits. However, feature engineering is labor-intensive, and toolkits may lead to error propagation. In the past few years, deep learning techniques attract substantial attention from NLP researchers because of its powerful representation learning ability and the rapid increase in computational resources. Their limitations include high computational costs and a lack of adequate annotated training data. In this paper, we conduct a comprehensive survey of causality extraction. We initially introduce primary forms existing in the causality extraction: explicit intra-sentential causality, implicit causality, and inter-sentential causality. Next, we list benchmark datasets and modeling assessment methods for causal relation extraction. Then, we present a structured overview of the three techniques with their representative systems. Lastly, we highlight existing open challenges with their potential directions.

Highlights

With the rapid growth of unstructured texts online, information extraction (IE) plays a vital role in natural language processing (NLP) research
Based on the assumption that dependency paths between cause and effect can be viewed as background knowledge, they use a wide range of such paths, regardless of whether cause and effect appear within one sentence or in adjacent sentences, taking web texts as extra input
Causal relations in natural language text play a key role in clinical decision-making, biomedical knowledge discovery, emergency management, news topic references, etc

Summary

Introduction

With the rapid growth of unstructured texts online, information extraction (IE) plays a vital role in NLP research. RE refers to extracted and classified semantic relationships, such as whole–part, product–producer, and cause–effect from text. The critical issues of whether a disease is the reason for a symptom depend on if there are cause–effect relation between them Extracting such kinds of causal relations from the medical literature can support constructing a knowledge graph, which can assist doctors in quickly finding causality, like diseases-cause-symptoms, diseases-bring-complications, treatments-improveconditions, and customize treatment plans. The task of CE focuses on developing systems for identifying cause–effect relations between pairs of labeled nouns from text [5]. CE studies can be classified in terms of different representation patterns: explicit or implicit causality, intra- or inter-sentential causality. Causality in many texts is implicit and/or inter-sentential conditions, which are more complicated than basic kinds of causality.

Previous surveys

Benchmark datasets

Balanced Related works

Evaluation metrics

Knowledge-based approaches

Explicit intra-sentential causality

Implicit causality

Inter-sentential causality

Statistical machine learning-based approaches

Explicit Intra-sentential causality

Deep learning-based approaches

Systems summary

Open problems and future directions

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A survey on extraction of causal relations from natural language text

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Knowledge and Information Systems

Lead the way for us

Similar Papers

Special issue on statistical learning of natural language structured input and output
Lluís Màrquez ... Alessandro Moschitti
Natural Language Engineering | VOL. 18
Lluís Màrquez, et. al.Lluís Màrquez ... Alessandro Moschitti
14 Mar 2012
Natural Language Engineering | VOL. 18

Natural Language Processing and Computational Linguistics
Junichi Tsujii
Computational Linguistics | VOL. -
Junichi TsujiiJunichi Tsujii
07 Dec 2021
Computational Linguistics | VOL. -

From semantics to pragmatics: where IS can lead in Natural Language Processing (NLP) research
Yan Li ... Dapeng Liu
European Journal of Information Systems | VOL. 30
Yan Li, et. al.Yan Li ... Dapeng Liu
24 Sep 2020
European Journal of Information Systems | VOL. 30

A Comparison of Three Machine Learning Methods for Multivariate Genomic Prediction Using the Sparse Kernels Method (SKM) Library.
Osval A Montesinos-López ... Bernabe Cano-Paez
Genes | VOL. 13
Osval A Montesinos-López, et. al.Osval A Montesinos-López ... Bernabe Cano-Paez
21 Aug 2022
Genes | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A survey on extraction of causal relations from natural language text

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Knowledge and Information Systems