Deep Residual Learning for Weakly-Supervised Relation Extraction

Yiyao Huang,William Yang Wang

doi:10.18653/v1/d17-1191

Abstract

Deep residual learning (ResNet) is a new method for training very deep neural networks using identity mapping for shortcut connections. ResNet has won the ImageNet ILSVRC 2015 classification task, and achieved state-of-the-art performances in many computer vision tasks. However, the effect of residual learning on noisy natural language processing tasks is still not well understood. In this paper, we design a novel convolutional neural network (CNN) with residual learning, and investigate its impacts on the task of distantly supervised noisy relation extraction. In contradictory to popular beliefs that ResNet only works well for very deep networks, we found that even with 9 layers of CNNs, using identity mapping could significantly improve the performance for distantly-supervised relation extraction.

Highlights

Relation extraction is the task of predicting attributes and relations for entities in a sentence (Zelenko et al, 2003; Bunescu and Mooney, 2005; GuoDong et al, 2005)
We investigate the effects of training deeper convolutional neural network (CNN) for distantly-supervised relation extraction
In contrast to popular beliefs in vision that deep residual network only works for very deep CNNs, we show that even with a moderately deep CNNs, there are substantial improvements over vanilla CNNs for relation extraction

Summary

Introduction

Relation extraction is the task of predicting attributes and relations for entities in a sentence (Zelenko et al, 2003; Bunescu and Mooney, 2005; GuoDong et al, 2005). Among all the machine learning approaches for distant supervision, the recently proposed Convolutional Neural Networks (CNNs) model (Zeng et al, 2014) achieved the state-of-the-art performance. Following their success, Zeng et al (2015) proposed a piece-wise max-pooling strategy to improve the CNNs. Various attention strategies (Lin et al, 2016; Shen and Huang, 2016) for CNNs are proposed, obtaining impressive results. We show that our deep residual network model outperforms CNNs by a large margin empirically, obtaining state-of-the-art performances;. Our identity mapping with shortcut feedback approach can be applicable to any variants of CNNs for relation extraction

Vector Representation

Position Embeddings

Convolution

Residual Convolution Block

Experimental Settings

NYT-Freebase Dataset Performance

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Residual Learning for Weakly-Supervised Relation Extraction

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2017
Citations: 108	License type: cc-by

Similar Papers

Telugu handwritten character recognition using deep residual learning
Bindu Madhuri Cheekati ... Roje Spandana Rajeti
-
Bindu Madhuri Cheekati, et. al.Bindu Madhuri Cheekati ... Roje Spandana Rajeti
07 Oct 2020
07 Oct 2020

Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex
Astrid A Zeman ... Hans Op De Beeck
Scientific reports | VOL. 10
Astrid A Zeman, et. al.Astrid A Zeman ... Hans Op De Beeck
12 Feb 2020
Scientific reports | VOL. 10

Deep Residual Learning for Facial Emotion Recognition
Sagar Mishra ... Duryodhan Chaulagain
-
Sagar Mishra, et. al.Sagar Mishra ... Duryodhan Chaulagain
23 Jul 2021
23 Jul 2021

Study on the Application of Improved Audio Recognition Technology Based on Deep Learning in Vocal Music Teaching
Nan Liu
Mathematical Problems in Engineering | VOL. 2022
Nan LiuNan Liu
18 Aug 2022
Mathematical Problems in Engineering | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Residual Learning for Weakly-Supervised Relation Extraction

Abstract

Highlights

Summary

Talk to us

Similar Papers