Applying Semantic Role Labeling and Spreading Activation Techniques for Semantic Information Retrieval

Tomas Vileiniskis,Rita Butkiene

doi:10.5755/j01.itc.49.2.24985

Tomas Vileiniskis, Rita Butkiene

Open Access

https://doi.org/10.5755/j01.itc.49.2.24985

Copy DOI

Abstract

Semantically enhanced information retrieval (IR) is aimed at improving classical IR methods and goes way beyond plain Boolean keyword matching with the main goal of better serving implicit and ambiguous information needs. As a de-facto pre-requisite to semantic IR, different information extraction (IE) techniques are used to mine unstructured text for underlying knowledge. In this paper we present a method that combines both IE and IR to enable semantic search in natural language texts. First, we apply semantic role labeling (SRL) to automatically extract event-oriented information found in natural language texts to an RDF knowledge graph leveraging semantic web technology. Second, we investigate how a custom flavored graph traversal spreading activation algorithm can be employed to interpret user’s information needs on top of the prior-extracted knowledge base. Finally, we present an assessment on the applicability of our method for semantically enhanced IR. An experimental evaluation on partial WikiQA dataset shows the strengths of our approach and also unveils common pitfalls that we use as guidelines to draw further work directions in the open-domain semantic search field.

Highlights

In the context of traditional web search, information retrieval (IR) has been known as a task of obtaining documents relevant to user’s information needs, typically expressed by a form of a query
An evaluation of the proposed semantically enhanced IR method was conducted by firstly applying our semantic role labeling (SRL) Triple Extraction information extraction (IE) component on WikiQA [30] full dataset, and secondly by using corresponding query set to see how spreading activation algorithm behaves on top of the extracted Resource Description Framework (RDF) knowledge graph
The SRL annotator skips predicate “be.01” making it a drawback when dealing with factoid-like questions in WikiQA dataset as the required triples do not get asserted in the knowledge base during information extraction

Summary

Introduction

In the context of traditional web search, information retrieval (IR) has been known as a task of obtaining documents relevant to user’s information needs, typically expressed by a form of a query. As the target search space increases, more focus should be directed towards effective document content processing in order to distinguish between new and repeated knowledge sources We encounter another paradigm known as information extraction (IE). P2: [A0: YouTube] [V: operate.01] [A3: as a subsidiary of Google] Such structure represents shallow semantics of a sentence where each of the predicates is accompanied by its main (A0, A1, A2) and adjunctive arguments (AMTMP, AM-LOC, AM- MN). Since the natural ambiguity behind user’s information needs and information sources cannot be covered by solely relying on shallow predicate argument structures, deep semantic analysis of the resulting arguments is necessary to be carried out. The extracted knowledge is serialized using Resource Description Framework (RDF) resulting in a directed labeled knowledge graph Such representation further allows treating query execution as a graph traversal task.

Related Work

SRL Triple Ontology

Interpreting User’s Information Needs

Experimental Evaluation

Dataset Pre-processing

SRL Triple Extraction

Information Extraction Results

Information Retrieval Results

Comparison with TF-IDF

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information Technology And Control	Publication Date: Jun 16, 2020
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Applying Semantic Role Labeling and Spreading Activation Techniques for Semantic Information Retrieval

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information Technology And Control

Lead the way for us

Similar Papers

Research on Semantic Role Labeling based on Dependency Parsing
Jiahao Wang ... Ning Ma
-
Jiahao Wang, et. al.Jiahao Wang ... Ning Ma
20 Aug 2022
20 Aug 2022

An adaptable, high-performance relation extraction system for complex sentences
Anu Thomas ... Sangeetha Sivanesan
Knowledge-Based Systems | VOL. 251
Anu Thomas, et. al.Anu Thomas ... Sangeetha Sivanesan
07 May 2022
Knowledge-Based Systems | VOL. 251

Semantic-Enhanced Information Search and Retrieval
Wang Wei ... Andrzej Bargiela
-
Wang Wei, et. al.Wang Wei ... Andrzej Bargiela
01 Jan 2007
01 Jan 2007

Limitations of information extraction methods and techniques for heterogeneous unstructured big data
Kiran Adnan ... Rehan Akbar
International Journal of Engineering Business Management | VOL. 11
Kiran Adnan, et. al.Kiran Adnan ... Rehan Akbar
01 Jan 2019
International Journal of Engineering Business Management | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Applying Semantic Role Labeling and Spreading Activation Techniques for Semantic Information Retrieval

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information Technology And Control