Xword: A Multi-lingual Framework for Expanding Words

Faisal Alshargi,Waseem Alromema,Saeedeh Shekarpour

doi:10.1007/978-3-030-33582-3_16

Abstract

The word expansion task has applicability in information retrieval and question answering systems. It relieves the vocabulary mismatch problem leading to a higher recall. The recent word embedding models demonstrated merit for the word expansion task in comparison to the traditional n-gram models. However, to acquire quality embeddings in each language, the processes of corpus compilation, normalization and parameter tuning are time-consuming and challenging especially for poor resources languages such as Arabic. In this paper, we introduce Xword as an online multi-lingual framework for automatic word expansion. Xword relies on both pre-trained ad hoc word embedding models and n-gram models for the expansion task. Xword currently includes the two languages Arabic, and German. Xword represents the results of each model both individually and collectively. Additionally, Xword can filter out the result set based on sentiment and part of speech (POS) tag of every single word. Xword is available as a Web API along with the downloadable models and sufficient documentation on our public GitHub.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Xword: A Multi-lingual Framework for Expanding Words

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Analysis of Public Perceptions Towards the COVID-19 Vaccination Drive: A Case Study of Tweets with Machine Learning Classifiers
Koushal Kumar ... Bhagwati Prasad Pande
-
Koushal Kumar, et. al.Koushal Kumar ... Bhagwati Prasad Pande
01 Jan 2021
01 Jan 2021

A reproducible survey on word embeddings and ontology-based methods for word similarity: Linear combinations outperform the state of the art
Juan J Lastra-Díaz ... Eneko Agirre
Engineering Applications of Artificial Intelligence | VOL. 85
Juan J Lastra-Díaz, et. al.Juan J Lastra-Díaz ... Eneko Agirre
01 Aug 2019
Engineering Applications of Artificial Intelligence | VOL. 85

A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection
Hanqing Zhou ... Amal Zouaq
Information | VOL. 10
Hanqing Zhou, et. al.Hanqing Zhou ... Amal Zouaq
25 Dec 2018
Information | VOL. 10

Rule Based Part of Speech Tagger for Arabic Question Answering System
Samah Ali Al-Azani ... C Namrata Mahender
-
Samah Ali Al-Azani, et. al.Samah Ali Al-Azani ... C Namrata Mahender
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Xword: A Multi-lingual Framework for Expanding Words

Abstract

Talk to us

Similar Papers