Hybrid Distance-based, CNN and Bi-LSTM System for Dictionary Expansion

Béla Benedek Szakács,Tamás Mészáros

doi:10.36244/icj.2020.4.2

Abstract

Dictionaries like Wordnet can help in a variety of Natural Language Processing applications by providing additional morphological data. They can be used in Digital Humanities research, building knowledge graphs and other applications. Creating dictionaries from large corpora of texts written in a natural language is a task that has not been a primary focus of research, as other tasks have dominated the field (such as chat-bots), but it can be a very useful tool in analysing texts. Even in the case of contemporary texts, categorizing the words according to their dictionary entry is a complex task, and for less conventional texts (in old or less researched languages) it is even harder to solve this problem automatically. Our task was to create a software that helps in expanding a dictionary containing word forms and tagging unprocessed text. We used a manually created corpus for training and testing the model. We created a combination of Bidirectional Long-Short Term Memory networks, convolutional networks and a distancebased solution that outperformed other existing solutions. While manual post-processing for the tagged text is still needed, it significantly reduces the amount of it.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hybrid Distance-based, CNN and Bi-LSTM System for Dictionary Expansion

Abstract

Talk to us

Similar Papers

More From: Infocommunications journal

Lead the way for us

Similar Papers

Application of Deep Learning for Reservoir Porosity Prediction and Self Organizing Map for Lithofacies Prediction
Mazahir Hussain ... Umar Ashraf
Journal of Applied Geophysics | VOL. 230
Mazahir Hussain, et. al.Mazahir Hussain ... Umar Ashraf
31 Aug 2024
Journal of Applied Geophysics | VOL. 230

Automatic gear shift strategy for manual transmission of mine truck based on Bi-LSTM network
Liyong Wang ... Min Xie
Expert Systems With Applications | VOL. 209
Liyong Wang, et. al.Liyong Wang ... Min Xie
03 Aug 2022
Expert Systems With Applications | VOL. 209

Evaluation of Deep Learning Models for Multi-Step Ahead Time Series Prediction
Rohitash Chandra ... Rishabh Gupta
IEEE Access | VOL. 9
Rohitash Chandra, et. al.Rohitash Chandra ... Rishabh Gupta
01 Jan 2020
IEEE Access | VOL. 9

Photovoltaic Power Forecasting With an Ensemble Multi-Input Deep Learning Approach
Fariba Dehghan ... Mohsen Parsa Moghaddam
-
Fariba Dehghan, et. al.Fariba Dehghan ... Mohsen Parsa Moghaddam
08 Feb 2023
08 Feb 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hybrid Distance-based, CNN and Bi-LSTM System for Dictionary Expansion

Abstract

Talk to us

Similar Papers

More From: Infocommunications journal