Chinese Relation Extraction Using Extend Softword

Bo Kong,Fuyuan Wei,Guangyao Wang,Liruizhi Jia,Shengquan Liu

doi:10.1109/access.2021.3102225

Abstract

In recent years, many scholars have chosen to use word lexicons to incorporate word information into a model based on character input to improve the performance of Chinese relation extraction (RE). For example, Li <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">et al.</i> proposed the MG-Lattice model in 2019 and achieved state-of-the-art (SOTA) results. However, MG-Lattice still has the problem of information loss due to its model structure, which affects the performance of Chinese RE. This paper proposes an adaptive method to include word information at the embedding layer using a word lexicon to merge all words that match each character into a character input-based model to solve the information loss problem of MG-Lattice. The method can be combined with other general neural system networks and has transferability. Experimental studies on two benchmark Chinese RE datasets show that our method achieves an inference speed up to 12.9 times faster than the SOTA model, along with a better performance. The experimental results also show that this method combined with the BERT pretrained model can effectively supplement the information obtained from the pretrained model, further improving the performance of Chinese RE.

Highlights

R ELATION extraction (RE) is a subtask of information extraction, aiming to extract semantic relations between entity pairs in natural language sentences
Each character of the input sequence is mapped to a dense vector, and a dictionary matching method is used to introduce the word information and merge its weight into the character representation to add its vocabulary enhancement
Multiple standard evaluation metrics are applied in the experiments, including the precision, recall, F1-score and area under the curve (AUC)

Summary

Introduction

R ELATION extraction (RE) is a subtask of information extraction, aiming to extract semantic relations between entity pairs in natural language sentences. Unlike an English RE model, a Chinese RE model based on word input must first perform word segmentation because sentences in Chinese are not naturally segmented. Using a model based on word input will be affected by word segmentation performance. As shown, the Chinese sentence “武汉研究所有杜鹃(there are cuckoos in Wuhan institute)” has two entities, which are “武汉(Wuhan)” and “杜鹃(cuckoos)”. In this case, the correct segmentation is “武汉(Wuhan)/研究所(institute)/有(have)/杜鹃(cuckoos)”. If the sentence is divided into “武汉(Wuhan)/研究(studies)/所

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Chinese Relation Extraction Using Extend Softword

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Chinese satellite frequency and orbit entity relation extraction method based on dynamic integrated learning
Yuanzhi He ... Zhiqiang Li
Digital Communications and Networks | VOL. -
Yuanzhi He, et. al.Yuanzhi He ... Zhiqiang Li
01 May 2024
Digital Communications and Networks | VOL. -

Joint Extraction of Clinical Entities and Relations Using Multi-head Selection Method
Xintao Fang ... Yuting Song
-
Xintao Fang, et. al.Xintao Fang ... Yuting Song
11 Dec 2021
11 Dec 2021

Utilizing Entity-Based Gated Convolution and Multilevel Sentence Attention to Improve Distantly Supervised Relation Extraction.
Qian Yi ... Shuwu Zhang
Computational Intelligence and Neuroscience | VOL. 2021
Qian Yi, et. al.Qian Yi ... Shuwu Zhang
01 Jan 2020
Computational Intelligence and Neuroscience | VOL. 2021

Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors
Guozheng Li ... Yikai Guo
-
Guozheng Li, et. al.Guozheng Li ... Yikai Guo
01 Aug 2024
01 Aug 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Chinese Relation Extraction Using Extend Softword

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access