Speech Recognition Using Cross Correlation and Feature Analysis Using Mel-Frequency Cepstral Coefficients and Pitch

Ruchi Gupte,Reena Sonkusare,Sarah Hawa

doi:10.1109/inocon50539.2020.9298320

Abstract

Speech recognition systems have been widely used and implemented in telephony systems, smartphones, security and home automation systems, where an individual's voice needs to be identified and recognised by the system in order to execute the next set of instructions. In this paper, we aim to develop a method to identify the voice and the speech of an individual using basic audio samples of isolated words through the use of cross correlation in MATLAB. It explores an extremely low computational speech recognition and speaker identification technique which does not rely on complex speech algorithms or trained models. Speaker identification and speech recognition are performed on a specified word instruction set in order to distinguish and analyse not only the words but also the speech pattern unique to an individual. After cleaning the audio signal to remove noise, the signal will be analysed using the property of cross-correlation, and other speech parameters such as Mel-Frequency Cepstral Coefficients (MFCCs) and pitch will be extracted. Results show an overall 92% accuracy for a set of 5 command words and 2 unique individuals using only 10 training samples. It is also found that audio signals of the same word recorded by the same person have a significantly greater degree of correlation than other audio signals and thus can be the basis for a valid speech recognition method for small scale applications where the processing capabilities of a system are low.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech Recognition Using Cross Correlation and Feature Analysis Using Mel-Frequency Cepstral Coefficients and Pitch

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Genetic Algorithm for Combined Speaker and Speech Recognition using Deep Neural Networks
Gurpreet Kaur ... Amod Kumar
Journal of Telecommunications and Information Technology | VOL. 2
Gurpreet Kaur, et. al.Gurpreet Kaur ... Amod Kumar
29 Jun 2018
Journal of Telecommunications and Information Technology | VOL. 2

Mel Frequency Cepstral Coefficients (MFCC) based speaker identification in noisy environment using wiener filter
Paresh M Chauhan ... Nikita P Desai
-
Paresh M Chauhan, et. al.Paresh M Chauhan ... Nikita P Desai
01 Mar 2014
01 Mar 2014

Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems
Md Jahangir Alam ... Douglas O’Shaughnessy
Cognitive Computation | VOL. 5
Md Jahangir Alam, et. al.Md Jahangir Alam ... Douglas O’Shaughnessy
07 Dec 2012
Cognitive Computation | VOL. 5

Αναγνώριση ομιλητή και ομιλίας με χρήση κυματιδίων
Μιχάλης Σιαφαρίκας
-
Μιχάλης ΣιαφαρίκαςΜιχάλης Σιαφαρίκας
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech Recognition Using Cross Correlation and Feature Analysis Using Mel-Frequency Cepstral Coefficients and Pitch

Abstract

Talk to us

Similar Papers