Comparison of Three Auditory Frequency Scales in Feature Extraction on Myanmar Digits Recognition

Hay Mar Soe Naing,Risanuri Hidayat,Bondhan Winduratna,Yoshikazu Miyanaga

doi:10.1109/iciteed.2018.8534768

Abstract

With the rapidly growth of digital computers, there has been an increasing demand to communicate with machines in efficient spoken manner. Speech recognition is the process of translating fromspoken words into readable text. To get the robust and reliable transcription text from recognizer,proper feature extraction methods are needed. This paper is concerned to an approach of features extraction on spoken Myanmar digits recognition. In this study, the recognition performances of Fast Fourier Transform (FFT), Mel Frequency Cepstrum Coefficients (MFCC), Linear Predictive Coding (LPC)and Linear Prediction Cepstral Coefficients (LPCC) methods will be compared. Even though the frequency spacing with Mel scale is extensively used in Automatic Speech Recognition (ASR), this paper demonstrates another scale of auditory frequency spectrum namely, Bark and Equivalent Rectangular Bandwidth (ERB) scales. The results have achieved the better performance than the Mel scale. The k-Nearest Neighbor (KNN) is employed as the classifier and ten digits of Myanmar language from twelve speakers are collected. According to these experiments, the results show the best recognition rates of 88.6% with the used of feature extraction based on ERB scale band pass filter.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of Three Auditory Frequency Scales in Feature Extraction on Myanmar Digits Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Real-time prediction of upcoming respiratory events via machine learning using snoring sound signal.
Bochun Wang ... Ji Wu
Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine | VOL. 17
Bochun Wang, et. al.Bochun Wang ... Ji Wu
12 Apr 2021
Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine | VOL. 17

Improved linear predictive coding method for speech recognition
Jiang Hai ... Er Meng Joo
-
Jiang Hai, et. al. Jiang Hai ... Er Meng Joo
15 Dec 2003
15 Dec 2003

A comparative study on feature dependency of the Manipuri language based phonetic engine
Sushanta Kabir Dutta ... Salam Nandakishor
-
Sushanta Kabir Dutta, et. al.Sushanta Kabir Dutta ... Salam Nandakishor
01 Apr 2017
01 Apr 2017

Comparison of DTW and HMM for isolated word recognition
Sharada C Sajjan ... C Vijaya
-
Sharada C Sajjan, et. al.Sharada C Sajjan ... C Vijaya
01 Mar 2012
01 Mar 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of Three Auditory Frequency Scales in Feature Extraction on Myanmar Digits Recognition

Abstract

Talk to us

Similar Papers