Feature combination and stacking of recurrent and non-recurrent neural networks for LVCSR

Christian Plahl,Michael Kozielski,Hermann Ney,Ralf Schluter

doi:10.1109/icassp.2013.6638961

Abstract

This paper investigates the combination of different short-term features and the combination of recurrent and non-recurrent neural networks (NNs) on a Spanish speech recognition task. Several methods exist to combine different feature sets such as concatenation or linear discriminant analysis (LDA). Even though all these techniques achieve reasonable improvements, feature combination by multi-layer perceptrons (MLPs) outperforms all known approaches. We develop the concept of MLP based feature combination further using recurrent neural networks (RNNs). The phoneme posterior estimates derived from an RNN lead to a significant improvement over the result of the MLPs and achieve a 5% relative better word error rate (WER) with much less parameters. Moreover, we improve the system performance further by combining an MLP and an RNN in a hierarchical framework. The MLP benefits from the preprocessing of the RNN. All NNs are trained on phonemes. Nevertheless, the same concepts could be applied using context-dependent states. In addition to the improvements in recognition performance w.r.t. WER, NN based feature combination methods reduce both, the training and the testing complexity. Overall, the systems are based on a single set of acoustic models, together with the training of different NNs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Feature combination and stacking of recurrent and non-recurrent neural networks for LVCSR

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Training data pseudo-shuffling and direct decoding framework for recurrent neural network based acoustic modeling
Naoyuki Kanda ... Hisashi Kawai
-
Naoyuki Kanda, et. al.Naoyuki Kanda ... Hisashi Kawai
01 Dec 2015
01 Dec 2015

Recurrent Neural Network for electromyographic gesture recognition in transhumeral amputees
Olivier Barron ... Sofiane Achiche
Applied Soft Computing | VOL. 96
Olivier Barron, et. al.Olivier Barron ... Sofiane Achiche
06 Aug 2020
Applied Soft Computing | VOL. 96

Trajectory generation and modulation using dynamic neural networks
P. Zegers ... M.K. Sundareshan
IEEE Transactions on Neural Networks | VOL. 14
P. Zegers, et. al.P. Zegers ... M.K. Sundareshan
01 May 2003
IEEE Transactions on Neural Networks | VOL. 14

Analyzing the Short-Term Dependency in Ultra-High Magnetic Response Systems - Modeling Sequential Data with Non-Recurrent Neural Networks
Jieming Sun ... Lichun Li
Procedia Computer Science | VOL. 185
Jieming Sun, et. al.Jieming Sun ... Lichun Li
01 Jan 2020
Procedia Computer Science | VOL. 185

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature combination and stacking of recurrent and non-recurrent neural networks for LVCSR

Abstract

Talk to us

Similar Papers