Abstract

Having access to large lexical and grammatical resources when creating a new language resource is essential for its enhancement and enrichment. This paper describes the interplay and interactive utilization of different language technology tools and resources, in particular the Swedish lexicon SALDO and Swedish Constructicon, in the creation of Swedish FrameNet. We show how integrating resources in a larger infrastructure is much more than the sum of the parts.

Highlights

  • This paper describes how Swedish language technology resources are exploited to construct Swedish FrameNet (SweFN),1 a lexical-semantic resource that has been expanded from and constructed in line with Berkeley FrameNet (BFN)

  • We have so far focused on three tasks: (1) semantic role labeling (SRL) (Johansson et al, 2012); (2) automatic sentence extraction, i.e. finding example sentences with varied syntactic and semantic complexities (Pilan et al, 2013); (3) automatic expansion of the SweFN lexicon to determine which frame is evoked by a given word by combining statistical and rule-based methods based on SALDO descriptors and extracted information from Korp (Johansson, 2014)

  • We have here focused on how two language technology resources, SALDO and Swedish Constructicon (SweCcn), are exploited in the development of SweFN, and on how these resources enhance each other and other resources

Read more

Summary

Introduction

This paper describes how Swedish language technology resources are exploited to construct Swedish FrameNet (SweFN), a lexical-semantic resource that has been expanded from and constructed in line with Berkeley FrameNet (BFN). To get a visualization of the notion of semantic frames consider the frame Vehicle landing. It has the following definition in BFN: “A flying VEHICLE comes to the ground at a GOAL in a controlled fashion, typically (but not necessarily) operated by an operator.”. VEHICLE and GOAL are the core elements that together with the description uniquely characterize the frame. Their semantic types are Physical object and Location. The non-core elements of the frame are: CIRCUMSTANCES, COTHEME, DEGREE, DEPICTIVE, EVENT DESCRIPTION, FREQUENCY, GOAL CONDITIONS, MANNER, MEANS, MODE OF TRANSPORTATION, PATH, PERIOD OF ITERATIONS, PLACE, PURPOSE, RE ENCODING, SOURCE, and TIME. In contrast to the construction of English resources, as well as the construction of framenets for other languages, the resources used to construct SweFN are all linked in a unique infrastructure of language resources

The development of framenets in other languages
Lexical and grammatical resources and tools for Swedish
SALDO – association lexicon
Swedish Constructicon
Karp – open lexical infrastructure
Korp – Swedish corpora
The development of SweFN
Consistency checks and automatic extension of the data
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call