Preliminary Results on Different Text Processing Tasks Using Encoder-Decoder Networks and the Causal Feature Extractor

Adrián Javaloy,Ginés García-Mateos

doi:10.3390/app10175772

Adrián Javaloy, Ginés García-Mateos

Open Access

https://doi.org/10.3390/app10175772

Copy DOI

Abstract

Deep learning methods are gaining popularity in different application domains, and especially in natural language processing. It is commonly believed that using a large enough dataset and an adequate network architecture, almost any processing problem can be solved. A frequent and widely used typology is the encoder-decoder architecture, where the input data is transformed into an intermediate code by means of an encoder, and then a decoder takes this code to produce its output. Different types of networks can be used in the encoder and the decoder, depending on the problem of interest, such as convolutional neural networks (CNN) or long-short term memories (LSTM). This paper uses for the encoder a method recently proposed, called Causal Feature Extractor (CFE). It is based on causal convolutions (i.e., convolutions that depend only on one direction of the input), dilatation (i.e., increasing the aperture size of the convolutions) and bidirectionality (i.e., independent networks in both directions). Some preliminary results are presented on three different tasks and compared with state-of-the-art methods: bilingual translation, LaTeX decompilation and audio transcription. The proposed method achieves promising results, showing its ubiquity to work with text, audio and images. Moreover, it has a shorter training time, requiring less time per iteration, and a good use of the attention mechanisms based on attention matrices.

Highlights

Deep neural networks (DNN) are going through a golden era, demonstrating great effectiveness and high ubiquity to be used in different areas of research, in natural language processing (NLP) tasks
Decompilation) and word error rate, respectively; ACC and PER: accuracy and perplexity obtained for the validation set, respectively
We analyzed the feasibility of a novel type of encoder, the Causal Feature Extractor, as a part of an encoder-decoder deep neural network, in different problems of machine neural translation

Summary

Introduction

Deep neural networks (DNN) are going through a golden era, demonstrating great effectiveness and high ubiquity to be used in different areas of research, in natural language processing (NLP) tasks. The existing approaches are divided into recurrent and non-recurrent models; between them, the Transformer model by Vaswani et al [2] achieved remarkable improvements by exploiting the idea of fully attention-based models. Some works, such as the ConvS2S model by Gehring et al [3], address NMT problems in a fully convolutional approach, obtaining results that are comparable to the state of the art. This methodology has been applied to speech recognition in audio, such as the work by Kameoka et al [4]. Deng et al [6] proposed a convolutional solution to this problem using a hierarchical attention mechanism called coarse-to-fine, which produced significant improvements over previous systems with simpler attention models

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Aug 20, 2020
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Preliminary Results on Different Text Processing Tasks Using Encoder-Decoder Networks and the Causal Feature Extractor

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Text Normalization Using Encoder–Decoder Networks Based on the Causal Feature Extractor
Adrián Javaloy ... Ginés García-Mateos
Applied Sciences | VOL. 10
Adrián Javaloy, et. al.Adrián Javaloy ... Ginés García-Mateos
30 Jun 2020
Applied Sciences | VOL. 10

State of charge estimation for Li-ion battery during dynamic driving process based on dual-channel deep learning methods and conditional judgement
Qiang Zhang ... Menghan Li
Energy | VOL. 294
Qiang Zhang, et. al.Qiang Zhang ... Menghan Li
08 Mar 2024
Energy | VOL. 294

Deep Learning for Sentiment Analysis of Tunisian Dialect
Abir Masmoudi ... Lamia Hadrich Belguith
Computación y Sistemas | VOL. 25
Abir Masmoudi, et. al.Abir Masmoudi ... Lamia Hadrich Belguith
15 Feb 2021
Computación y Sistemas | VOL. 25

Classification of Intracranial Hemorrhage (CT) Images Using CNN- LSTM Method and Image-Based GLCM Features
Sangepu Nagaraju
Journal of Electrical Systems | VOL. 20
Sangepu Nagaraju Sangepu Nagaraju
04 May 2024
Journal of Electrical Systems | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Preliminary Results on Different Text Processing Tasks Using Encoder-Decoder Networks and the Causal Feature Extractor

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences