Neural Networks For Speech Recognition Research Articles

Introduction. The article presents an overview of modern neural network models for natural language processing. Research into natural language processing is of interest as the need to process large amounts of audio and text information accumulated in recent decades has increased. The most discussed in foreign literature are the features of the processing of spoken language. The aim of the work is to present modern models of neural networks in the field of oral speech processing.Materials and Methods. Applied research on understanding spoken language is an important and far-reaching topic in the natural language processing. Listening comprehension is central to practice and presents a challenge. This study meets a method of hearing detection based on deep learning. The article briefly outlines the substantive aspects of various neural networks for speech recognition, using the main terms associated with this theory. A brief description of the main points of the transformation of neural networks into a natural language is given.Results. A retrospective analysis of foreign and domestic literary sources was carried out alongside with a description of new methods for oral speech processing, in which neural networks were used. Information about neural networks, methods of speech recognition and synthesis is provided. The work includes the results of diverse experimental works of recent years. The article elucidates the main approaches to natural language processing and their changes over time, as well as the emergence of new technologies. The major problems currently existing in this area are considered.Discussion and Conclusions. The analysis of the main aspects of speech recognition systems has shown that there is currently no universal system that would be self-learning, noise-resistant, recognizing continuous speech, capable of working with large dictionaries and at the same time having a low error rate.

Discriminative training techniques define state-of-the-art performance for automatic speech recognition systems. However, they are inherently prone to overfitting, leading to poor generalization performance when using limited training data. In order to address this issue, this paper presents a full Bayesian framework to account for model uncertainty in sequence discriminative training of factored TDNN acoustic models. Several Bayesian learning based TDNN variant systems are proposed to model the uncertainty over weight parameters and choices of hidden activation functions, or the hidden layer outputs. Efficient variational inference approaches using as few as one single parameter sample ensure their computational cost in both training and evaluation time comparable to that of the baseline TDNN systems. Statistically significant word error rate (WER) reductions of 0.4%-1.8% absolute (5%-11% relative) were obtained over a state-of-the-art 900 h speed perturbed Switchboard corpus trained baseline LF-MMI factored TDNN system using multiple regularization methods including F-smoothing, L2 norm penalty, natural gradient, model averaging and dropout, in addition to i-Vector plus learning hidden unit contribution (LHUC) based speaker adaptation and RNNLM rescoring. The efficacy of the proposed Bayesian techniques is further demonstrated in a comparison against the state-of-the-art performance obtained on the same task using the most recent hybrid and end-to-end systems reported in the literature. Consistent performance improvements were also obtained on a 450-h HKUST conversational Mandarin telephone speech recognition task. On a third cross domain adaptation task requiring rapidly porting a 1000-h LibriSpeech data trained system to a small DementiaBank elderly speech corpus, the proposed Bayesian TDNN LF-MMI systems outperformed the baseline system using direct weight fine-tuning by up to 2.5% absolute WER reduction.

Neural Networks For Speech Recognition Research Articles

Articles published on Neural Networks For Speech Recognition

Comparative analysis between application of transformer and recurrent neural network in speech recognition

A hybrid discriminant fuzzy DNN with enhanced modularity bat algorithm for speech recognition

Performance Comparison of Various Neural Networks for Speech Recognition

Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition.

MAC-based Artificial Neural network for voice command recognition

On the similarities of representations in artificial and brain neural networks for speech recognition.

From Microphone to Phoneme: An End-to-End Computational Neural Model for Predicting Speech Perception With Cochlear Implants.

Intelligent home control system based on BP neural network speech recognition

Analysis of natural language processing technology: modern problems and approaches

Enhancement automatic speech recognition by deep neural networks

Gsdnet: Gated Self-Supervised Denoising Speech Control Network

Nanopore base calling on the edge.

Analyzing and Visualizing Deep Neural Networks for Speech Recognition with Saliency-Adjusted Neuron Activation Profiles

Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition

Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition

Deep Neural Learning Adaptive Sequential Monte Carlo for Automatic Image and Speech Recognition

Investigating the applications of artificial intelligence in cyber security

Gated Time Delay Neural Network for Speech Recognition

Binary neural networks for speech recognition

Time Delay Recurrent Neural Network for Speech Recognition

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Neural Networks For Speech Recognition Research Articles

Articles published on Neural Networks For Speech Recognition

Comparative analysis between application of transformer and recurrent neural network in speech recognition

A hybrid discriminant fuzzy DNN with enhanced modularity bat algorithm for speech recognition

Performance Comparison of Various Neural Networks for Speech Recognition

Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition.

MAC-based Artificial Neural network for voice command recognition

On the similarities of representations in artificial and brain neural networks for speech recognition.

From Microphone to Phoneme: An End-to-End Computational Neural Model for Predicting Speech Perception With Cochlear Implants.

Intelligent home control system based on BP neural network speech recognition

Analysis of natural language processing technology: modern problems and approaches

Enhancement automatic speech recognition by deep neural networks

Gsdnet: Gated Self-Supervised Denoising Speech Control Network

Nanopore base calling on the edge.

Analyzing and Visualizing Deep Neural Networks for Speech Recognition with Saliency-Adjusted Neuron Activation Profiles

Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition

Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition

Deep Neural Learning Adaptive Sequential Monte Carlo for Automatic Image and Speech Recognition

Investigating the applications of artificial intelligence in cyber security

Gated Time Delay Neural Network for Speech Recognition

Binary neural networks for speech recognition

Time Delay Recurrent Neural Network for Speech Recognition