Feature Extraction for Token Based Word Alignment for Question Answering Systems

Lokesh Kumar Sharma ,Anubha Aggarwal,Namita Mittal

doi:10.13053/cys-22-4-3070

Abstract

Mapping between the source words and the target words in a set of parallel sentences are a crucial part of Question Answering (QA) systems. I fan accurate aligner is used in QA systems then the efficiency of these systems also gets increased. We purpose the aligner which despite using very less lexical resources gives very good results in terms of precision, recall and F1. Previous aligners either uses more lexical resources or uses very less lexical resources. Hence, we have used POS TAG and WordNet as lexical resources. But some words whose meaning we maynot know but these occur in a similar distributionand by observing their distribution these words aresimilar. Consider two sentences "Lambodar is theson of Parvati" and "Ganesha is the son of Parvati". Here we will not find the meaning of Lambodar and Ganesha in Wordnet but since they have similar distributions so they should be aligned. For these words, we used Distribution Similarity Feature in our word aligner. This distributional similarity helps our alignerin broader coverage of words. Previous aligners were having recall in the range of 75-86 but this aligner has recall in the range of 88.4-93.3. Similarly, Exact match of previous aligners was in the range of 21-35.3 but the proposed aligner’s exact match range is 46.1-58.6. Similarly F-measure and precision have also increased.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Feature Extraction for Token Based Word Alignment for Question Answering Systems

Abstract

Talk to us

Similar Papers

More From: Computación y Sistemas

Lead the way for us

Similar Papers

Hybrid query expansion using lexical resources and word embeddings for sentence retrieval in question answering
Massimo Esposito ... Hamido Fujita
Information Sciences | VOL. 514
Massimo Esposito, et. al.Massimo Esposito ... Hamido Fujita
03 Dec 2019
Information Sciences | VOL. 514

The Intel 80386 and new 32-bit microprocessors: Tabak, DMicroproc. Microprog. Vol 19 No 1 (January 1987) pp 59–74
-
Microprocessors and Microsystems | VOL. 11
--
01 May 1987
The Intel 80386 and new 32-bit microprocessors: Tabak, DMicroproc. Microprog. Vol 19 No 1 (January 1987) pp 59–74
-

An Ontology-Driven Question Answering System For Computer Network Module
M I M Nowshad ... U U Samantha Rajapaksha
-
M I M Nowshad, et. al.M I M Nowshad ... U U Samantha Rajapaksha
02 Dec 2021
02 Dec 2021

A Study of Deep Learning for Factoid Question Answering System
Min-Yuh Day ... Yu-Ling Kuo
-
Min-Yuh Day, et. al.Min-Yuh Day ... Yu-Ling Kuo
01 Aug 2020
01 Aug 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature Extraction for Token Based Word Alignment for Question Answering Systems

Abstract

Talk to us

Similar Papers

More From: Computación y Sistemas