Improvement of speech recognition results by a combination of systems

Rama Hasan,Sinan Khwandah,Pavlos Lazaridis,Hussein Hussein,Maximilian Eibl,Marc Ritter

doi:10.23919/iconac.2017.8082082

Abstract

The aim of this study is to suggest an algorithm that combines two speech recognition systems. These systems differ in the methods used in the feature extraction stage, but they have the same classifier Hidden Markov Model (HMM). The first system uses Mel-Frequency Cepstrum Coefficients (MFCC), the second one uses Linear Prediction Cepstrum Coefficients (LPCC), and the third system uses Perceptual Linear Predictive (PLP) features. The combination algorithm is applied separately on each couple of systems. The study is implemented on a data set that consists of the four voice commands: “shutdown”, “documents”, “restart”, and “net” pronounced by 33 people. In addition to the improvement of the speech recognition rate for isolated words, the study aimed to determine the most complementary couple of systems through studying two kinds of errors: simultaneous and dependent errors. The system depending on MFCC features provided the highest recognition rate with 85.44%. The results showed noticeable improvement of combined systems in comparison with the individual systems where combining MFCC & PLP provided the highest recognition rate with 93.44%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improvement of speech recognition results by a combination of systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Selection of Features for Emotion Recognition from Speech
Puja Ramesh Chaudhari ... John Sahaya Rani Alex
Indian Journal of Science and Technology | VOL. 9
Puja Ramesh Chaudhari, et. al.Puja Ramesh Chaudhari ... John Sahaya Rani Alex
27 Oct 2016
Indian Journal of Science and Technology | VOL. 9

Research and Realization on the Voice Command Recognition System for Robot Control Based on ARM9
Mei Juan Gao ... Zhi Xin Yang
Applied Mechanics and Materials | VOL. 44-47
Mei Juan Gao, et. al.Mei Juan Gao ... Zhi Xin Yang
06 Dec 2010
Applied Mechanics and Materials | VOL. 44-47

Comparison of Several Acoustic Modeling Techniques for Speech Emotion Recognition
Imen Trabelsi ... Med Salim Bouhlel
International Journal of Synthetic Emotions | VOL. 7
Imen Trabelsi, et. al.Imen Trabelsi ... Med Salim Bouhlel
01 Jan 2015
International Journal of Synthetic Emotions | VOL. 7

Comparison of Several Acoustic Modeling Techniques for Speech Emotion Recognition
Imen Trabelsi ... Med Salim Bouhlel
-
Imen Trabelsi, et. al.Imen Trabelsi ... Med Salim Bouhlel
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improvement of speech recognition results by a combination of systems

Abstract

Talk to us

Similar Papers