Generating Training Data for Semantic Role Labeling based on Label Transfer from Linked Lexical Resources

Silvana Hartmann,Iryna Gurevych,Judith Eckle-Kohler

doi:10.1162/tacl_a_00093

Abstract

We present a new approach for generating role-labeled training data using Linked Lexical Resources, i.e., integrated lexical resources that combine several resources (e.g., Word-Net, FrameNet, Wiktionary) by linking them on the sense or on the role level. Unlike resource-based supervision in relation extraction, we focus on complex linguistic annotations, more specifically FrameNet senses and roles. The automatically labeled training data ( www.ukp.tu-darmstadt.de/knowledge-based-srl/ ) are evaluated on four corpora from different domains for the tasks of word sense disambiguation and semantic role classification. Results show that classifiers trained on our generated data equal those resulting from a standard supervised setting.

Highlights

In this work, we present a novel approach to automatically generate training data for semantic role labeling (SRL)
Our novel approach to training data generation for FrameNet SRL uses the paradigm of distant supervision (Mintz et al, 2009) which has become popular in relation extraction
We show that discriminating patterns can improve the quality of the automatic sense labels. (ii) We use a distant supervision approach – building on lexical resources (LLRs) – to address the complex problem of training data generation for FrameNet role labeling, which builds upon the sense labeling in (i). (iii) Our detailed evaluation and analysis show that our approach for data generation is able to generalize across domains and languages

Summary

Introduction

We present a novel approach to automatically generate training data for semantic role labeling (SRL). It follows the distant supervision paradigm and performs knowledge-based label transfer from rich external knowledge sources to large corpora. Even though unsupervised approaches continue to gain popularity, SRL is typically still solved using supervised training on labeled data. Creating such labeled data requires manual annotations by experts,. Our novel approach to training data generation for FrameNet SRL uses the paradigm of distant supervision (Mintz et al, 2009) which has become popular in relation extraction. A particular type of knowledge base relevant for distant supervision are linked lexical resources (LLRs): integrated lexical resources that combine several resources (e.g., WordNet, FrameNet, Wiktionary) by linking them on the sense or on the role level

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Dec 1, 2016
Citations: 34	License type: cc-by

R Discovery Prime

R Discovery Prime

Generating Training Data for Semantic Role Labeling based on Label Transfer from Linked Lexical Resources

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

Fast Semantic Role Labeling for Chinese Based on Semantic Chunking
... Baobao Chang
-
, et. al. ... Baobao Chang
01 Jan 2009
01 Jan 2009

Generative models for syntactic and semantic structure prediction using latent variables

-

01 Jan 2015
01 Jan 2015

Prediction of thematic rank for structured semantic role labeling
Weiwei Sun ... Zhifang Sui
-
Weiwei Sun, et. al.Weiwei Sun ... Zhifang Sui
01 Jan 2009
01 Jan 2009

Selectional Preferences for Semantic Role Classification
Beñat Zapirain ... Lluís Màrquez
Computational Linguistics | VOL. 39
Beñat Zapirain, et. al.Beñat Zapirain ... Lluís Màrquez
01 Sep 2013
Computational Linguistics | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generating Training Data for Semantic Role Labeling based on Label Transfer from Linked Lexical Resources

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics