Recurrent Neural Network Language Model Research Articles

Recently, the hybrid convolutional neural network hidden Markov model (CNN-HMM) has been introduced for offline handwritten Chinese text recognition (HCTR) and has achieved state-of-the-art performance. However, modeling each of the large vocabulary of Chinese characters with a uniform and fixed number of hidden states requires high memory and computational costs and makes the tens of thousands of HMM state classes confusing. Another key issue of CNN-HMM for HCTR is the diversified writing style, which leads to model strain and a significant performance decline for specific writers. To address these issues, we propose a writer-aware CNN based on parsimonious HMM (WCNN-PHMM). First, PHMM is designed using a data-driven state-tying algorithm to greatly reduce the total number of HMM states, which not only yields a compact CNN by state sharing of the same or similar radicals among different Chinese characters but also improves the recognition accuracy due to the more accurate modeling of tied states and the lower confusion among them. Second, WCNN integrates each convolutional layer with one adaptive layer fed by a writer-dependent vector, namely, the writer code, to extract the irrelevant variability in writer information to improve recognition performance. The parameters of writer-adaptive layers are jointly optimized with other network parameters in the training stage, while a multiple-pass decoding strategy is adopted to learn the writer code and generate recognition results. Validated on the ICDAR 2013 competition of CASIA-HWDB database, the more compact WCNN-PHMM of a 7360-class vocabulary can achieve a relative character error rate (CER) reduction of 16.6% over the conventional CNN-HMM without considering language modeling. By adopting a powerful hybrid language model (N-gram language model and recurrent neural network language model), the CER of WCNN-PHMM is reduced to 3.17%. Moreover, the state-tying results of PHMM explicitly show the information sharing among similar characters and the confusion reduction of tied state classes. Finally, we visualize the learned writer codes and demonstrate the strong relationship with the writing styles of different writers. To the best of our knowledge, WCNN-PHMM yields the best results on the ICDAR 2013 competition set, demonstrating its power when enlarging the size of the character vocabulary.

Read full abstract

The performance of most error-correction (EC) algorithms that operate on genomics reads is dependent on the proper choice of its configuration parameters, such as the value of k in k-mer based techniques. In this work, we target the problem of finding the best values of these configuration parameters to optimize error correction and consequently improve genome assembly. We perform this in an adaptive manner, adapted to different datasets and to EC tools, due to the observation that different configuration parameters are optimal for different datasets, i.e., from different platforms and species, and vary with the EC algorithm being applied. We use language modeling techniques from the Natural Language Processing (NLP) domain in our algorithmic suite, Athena, to automatically tune the performance-sensitive configuration parameters. Through the use of N-Gram and Recurrent Neural Network (RNN) language modeling, we validate the intuition that the EC performance can be computed quantitatively and efficiently using the “perplexity” metric, repurposed from NLP. After training the language model, we show that the perplexity metric calculated from a sample of the test (or production) data has a strong negative correlation with the quality of error correction of erroneous NGS reads. Therefore, we use the perplexity metric to guide a hill climbing-based search, converging toward the best configuration parameter value. Our approach is suitable for both de novo and comparative sequencing (resequencing), eliminating the need for a reference genome to serve as the ground truth. We find that Athena can automatically find the optimal value of k with a very high accuracy for 7 real datasets and using 3 different k-mer based EC algorithms, Lighter, Blue, and Racer. The inverse relation between the perplexity metric and alignment rate exists under all our tested conditions—for real and synthetic datasets, for all kinds of sequencing errors (insertion, deletion, and substitution), and for high and low error rates. The absolute value of that correlation is at least 73%. In our experiments, the best value of k found by Athena achieves an alignment rate within 0.53% of the oracle best value of k found through brute force searching (i.e., scanning through the entire range of k values). Athena’s selected value of k lies within the top-3 best k values using N-Gram models and the top-5 best k values using RNN models With best parameter selection by Athena, the assembly quality (NG50) is improved by a Geometric Mean of 4.72X across the 7 real datasets.

Read full abstract

Recurrent Neural Network Language Model Research Articles

Related Topics

Articles published on Recurrent Neural Network Language Model

Educational Resource Material Translation System

Toward enriched decoding of mandarin spontaneous speech

Training RNN language models on uncertain ASR hypotheses in limited data scenarios

Survey Paper: Automatic Title Generation for Text with RNN and Pre-trained Transformer Language Model

RNN Language Processing Model-Driven Spoken Dialogue System Modeling Method.

Single-Stage Prediction Models Do Not Explain the Magnitude of Syntactic Disambiguation Difficulty.

Lexical Strata and Phonotactic Perplexity Minimization

G2Basy: A framework to improve the RNN language model and ease overfitting problem

Improving Amharic Speech Recognition System Using Connectionist Temporal Classification with Attention Model and Phoneme-Based Byte-Pair-Encodings

A Fuzzy-AHP-based Movie Recommendation System with the Bidirectional Recurrent Neural Network Language Model

Bidirectional Recurrent Neural Network Language Model: Cross Entropy Churn Metrics for Defect Prediction Modeling

A Single-Shot Generalized Device Placement for Large Dataflow Graphs

Discriminatively trained continuous Hindi speech recognition using integrated acoustic features and recurrent neural network language modeling

Assessment of Word-Level Neural Language Models for Sentence Completion

Writer-aware CNN for parsimonious HMM-based offline handwritten Chinese text recognition

Athena: Automated Tuning of k-mer based Genomic Error Correction Algorithms using Language Models

A Public Chinese Dataset for Language Model Adaptation

Exploiting Future Word Contexts in Neural Network Language Models for Speech Recognition

Bidirectional Recurrent Neural Network Language Model: Cross Entropy Churn Metrics for Defect Prediction Modelling

Character n-Gram Embeddings to Improve RNN Language Models

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Recurrent Neural Network Language Model Research Articles

Related Topics

Articles published on Recurrent Neural Network Language Model

Educational Resource Material Translation System

Toward enriched decoding of mandarin spontaneous speech

Training RNN language models on uncertain ASR hypotheses in limited data scenarios

Survey Paper: Automatic Title Generation for Text with RNN and Pre-trained Transformer Language Model

RNN Language Processing Model-Driven Spoken Dialogue System Modeling Method.

Single-Stage Prediction Models Do Not Explain the Magnitude of Syntactic Disambiguation Difficulty.

Lexical Strata and Phonotactic Perplexity Minimization

G2Basy: A framework to improve the RNN language model and ease overfitting problem

Improving Amharic Speech Recognition System Using Connectionist Temporal Classification with Attention Model and Phoneme-Based Byte-Pair-Encodings

A Fuzzy-AHP-based Movie Recommendation System with the Bidirectional Recurrent Neural Network Language Model

Bidirectional Recurrent Neural Network Language Model: Cross Entropy Churn Metrics for Defect Prediction Modeling

A Single-Shot Generalized Device Placement for Large Dataflow Graphs

Discriminatively trained continuous Hindi speech recognition using integrated acoustic features and recurrent neural network language modeling

Assessment of Word-Level Neural Language Models for Sentence Completion

Writer-aware CNN for parsimonious HMM-based offline handwritten Chinese text recognition

Athena: Automated Tuning of k-mer based Genomic Error Correction Algorithms using Language Models

A Public Chinese Dataset for Language Model Adaptation

Exploiting Future Word Contexts in Neural Network Language Models for Speech Recognition

Bidirectional Recurrent Neural Network Language Model: Cross Entropy Churn Metrics for Defect Prediction Modelling

Character n-Gram Embeddings to Improve RNN Language Models