Attention-based Encoder-decoder Research Articles

The spelling error is a mistake occurred while typing the text document. The applications like search engines, information retrieval, emails, etc., require user typing. In such applications, good spell-checker is essential to rectify the misspelling. Spell-checkers for western languages like English are very powerful and can handle any type of spelling errors, whereas in the case of Indian languages like Hindi, Urdu, Bengali, Kannada, Assamese, etc., the available spell-checkers are very basic ones. These spell-checkers are developed using traditional methods like statistical methods and rule-based methods. This article presents a novel model HINDIA to handle the spelling errors of the Hindi language, one of the most spoken languages in India. It utilizes a deep-learning method for spelling error detection and correction. The proposed spell-checking model works in two phases. In the first phase model identifies the erroneous words in the input sample and in the second phase it replaces the wrong words with the most probable correct words. Model HINDIA is developed using the attention-based encoder–decoder bidirectional recurrent neural network (BiRNN) which uses long short-term memory cells. Several modifications in the BiRNN have been made and network is fine-tuned to process the spelling errors of Hindi language. It uses publicly available dataset ‘monolingual corpus’ developed by IIT Mumbai for training and testing. The performance of the proposed model is evaluated in two scenarios. In the first scenario where the testing dataset is generated using split function. HINDIA performs significantly well with precision 0.86, recall 0.72, f-measure 0.78 and accuracy 0.80. Further, in the second scenario, where a dataset is manually generated its performance is fairly good with precision 0.81, recall 0.72, f-measure 0.76 and accuracy 0.74. Model HINDIA gives better performance than the deep-learning-based Malayalam spell-checker and some other deep-learning-based correction models present in the literature.

Read full abstract

Recognition of historical documents is a challenging problem due to the noised, damaged characters, and background. However, in Japanese historical documents, not only contains the mentioned problems, pre-modern Japanese characters were written in cursive and are connected. Therefore, character segmentation-based methods do not work well. This leads to the idea of creating a new recognition system. In this paper, we propose a human-inspired document reading system to recognize multiple lines of pre-modern Japanese historical documents. During the reading, people employ eyes movement to determine the start of a text line. Then, they move the eyes from the current character/word to the next character/word. They can also determine the end of a line or skip a figure to move to the next line. The eyes movement integrates with visual processing to operate the reading process in the brain. We employ attention-based encoder–decoder to implement this recognition system. First, the recognition system detects were to start a text line. Second, the system scans and recognize character by character until the text line is completed. Then, the system continues to detect the start of the next text line. This process is repeated until reading the whole document. As results, the system is successful to recognize multiple lines, connected and cursive characters without performing character/line segmentation. Besides, we also employ a coverage model which stores the history of eyes movement to predict the next movement more precisely. We tested our human-inspired recognition system on the pre-modern Japanese historical document provided by the PRMU Kuzushiji competition. The results of the experiments demonstrate the superiority and effectiveness of our proposed system by achieving Sequence Error Rate of 9.87% and 53.81% on level 2 and level 3 of the dataset, respectively. These results outperform to any other systems participated in the PRMU Kuzushiji competition.

Read full abstract

Attention-based Encoder-decoder Research Articles

Related Topics

Articles published on Attention-based Encoder-decoder

HINDIA: a deep-learning-based model for spell-checking of Hindi language

Dual Attention-Based Encoder-Decoder: A Customized Sequence-to-Sequence Learning for Soft Sensor Development.

Breathing Sound Segmentation and Detection Using Transfer Learning Techniques on an Attention-Based Encoder-Decoder Architecture.

Text Normalization Using Encoder–Decoder Networks Based on the Causal Feature Extractor

Spatial Relational Attention Using Fully Convolutional Networks for Image Caption Generation

Attention-Based Encoder-Decoder Model for Photovoltaic Power Generation Prediction

Multivariate time series forecasting via attention-based encoder–decoder framework

A Hybrid Short-Term Load Forecasting Framework with an Attention-Based Encoder–Decoder Network Based on Seasonal and Trend Adjustment

Pattern generation strategies for improving recognition of Handwritten Mathematical Expressions

RNN-LSTM-GRU based language transformation

Video Summarization With Attention-Based Encoder–Decoder Networks

An End-to-End Recognition System for Unconstrained Vietnamese Handwriting

Attention-Based Personalized Encoder-Decoder Model for Local Citation Recommendation.

Long short-term memory network with external memories for image caption generation

A Human-Inspired Recognition System for Pre-Modern Japanese Historical Documents

MindID

Modeling coverage with semantic embedding for image caption generation

Multi-Channel Encoder for Neural Machine Translation

Hybrid CTC/Attention Architecture for End-to-End Speech Recognition

Recent progress in deep end-to-end models for spoken language processing

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Attention-based Encoder-decoder Research Articles

Related Topics

Articles published on Attention-based Encoder-decoder

HINDIA: a deep-learning-based model for spell-checking of Hindi language

Dual Attention-Based Encoder-Decoder: A Customized Sequence-to-Sequence Learning for Soft Sensor Development.

Breathing Sound Segmentation and Detection Using Transfer Learning Techniques on an Attention-Based Encoder-Decoder Architecture.

Text Normalization Using Encoder–Decoder Networks Based on the Causal Feature Extractor

Spatial Relational Attention Using Fully Convolutional Networks for Image Caption Generation

Attention-Based Encoder-Decoder Model for Photovoltaic Power Generation Prediction

Multivariate time series forecasting via attention-based encoder–decoder framework

A Hybrid Short-Term Load Forecasting Framework with an Attention-Based Encoder–Decoder Network Based on Seasonal and Trend Adjustment

Pattern generation strategies for improving recognition of Handwritten Mathematical Expressions

RNN-LSTM-GRU based language transformation

Video Summarization With Attention-Based Encoder–Decoder Networks

An End-to-End Recognition System for Unconstrained Vietnamese Handwriting

Attention-Based Personalized Encoder-Decoder Model for Local Citation Recommendation.

Long short-term memory network with external memories for image caption generation

A Human-Inspired Recognition System for Pre-Modern Japanese Historical Documents

MindID

Modeling coverage with semantic embedding for image caption generation

Multi-Channel Encoder for Neural Machine Translation

Hybrid CTC/Attention Architecture for End-to-End Speech Recognition

Recent progress in deep end-to-end models for spoken language processing