&lt;b&gt;A comparative study between MFCC and LSF coefficients in automatic recognition of isolated digits pronounced in Portuguese and English&lt;/b&gt; - doi: 10.4025/actascitechnol.v35i4.19825

Diego Furtado Silva,Gustavo Enrique Almeida Prado Alves Batista,Vinícius Mourão Alves De Souza

doi:10.4025/actascitechnol.v35i4.19825

<b>A comparative study between MFCC and LSF coefficients in automatic recognition of isolated digits pronounced in Portuguese and English</b> - doi: 10.4025/actascitechnol.v35i4.19825

Diego Furtado Silva, Gustavo Enrique Almeida Prado Alves Batista + Show 1 more

Open Access

https://doi.org/10.4025/actascitechnol.v35i4.19825

Copy DOI

Journal: Acta Scientiarum. Technology	Publication Date: Oct 7, 2013
Citations: 12	License type: cc-by

Affiliation: Universidade de São Paulo

Abstract

Recognition of isolated spoken digits is the core procedure for a large number of applications which rely solely on speech for data exchange, as in telephone-based services, such as dialing, airline reservation, bank transaction and price quotation. Spoken digit recognition is generally a challenging task since the signals last for a short period of time and often some digits are acoustically very similar to other digits. The objective of this paper is to investigate the use of machine learning algorithms for spoken digit recognition and disclose the free availability of a database with digits pronounced in English and Portuguese to the scientific community. Since machine learning algorithms are fully dependent on predictive attributes to build precise classifiers, we believe that the most important task for successfully recognizing spoken digits is feature extraction. In this work, we show that Line Spectral Frequencies (LSF) provide a set of highly predictive coefficients. We evaluated our classifiers in different settings by altering the sampling rate to simulate low quality channels and varying the number of coefficients.

Highlights

In the last decades, research on speech and speaker recognition has attracted an enormous amount of attention, mainly due to the increasing number of applications such as biometric authentication, in which a user's voice is used to allow or deny access to a system; and accessibility, in which a user is able to control equipment or navigate the Internet using speech; facilitating these tasks to physically impaired people.An important speech recognition application, especially useful for telephone service providers, isActa Scientiarum
- We provide a wider set of experimental settings with different number of Mel-Frequency Cepstrum Coefficients (MFCC) and Line Spectral Frequencies (LSF) coefficients
Our results show that Line Spectral Frequencies (LSF) provide a set of highly predictive coefficients for digit recognition

Summary

Introduction

Research on speech and speaker recognition has attracted an enormous amount of attention, mainly due to the increasing number of applications such as biometric authentication, in which a user's voice is used to allow or deny access to a system; and accessibility, in which a user is able to control equipment or navigate the Internet using speech; facilitating these tasks to physically impaired people.An important speech recognition application, especially useful for telephone service providers, isActa Scientiarum. Research on speech and speaker recognition has attracted an enormous amount of attention, mainly due to the increasing number of applications such as biometric authentication, in which a user's voice is used to allow or deny access to a system; and accessibility, in which a user is able to control equipment or navigate the Internet using speech; facilitating these tasks to physically impaired people. An important speech recognition application, especially useful for telephone service providers, is. Companies make their services user-friendlier compared with entering numbers on the telephone keypad. This is even more evident when the procedure is done through mobile devices, in which there are no physically detached keyboards for dialing

Objectives

Methods

Results

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

<b>A comparative study between MFCC and LSF coefficients in automatic recognition of isolated digits pronounced in Portuguese and English</b> - doi: 10.4025/actascitechnol.v35i4.19825

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Acta Scientiarum. Technology

Lead the way for us

Similar Papers

Spoken Digit Recognition in Portuguese Using Line Spectral Frequencies
Diego F. Silva ... Vinícius M. A. de Souza
-
Diego F. Silva, et. al.Diego F. Silva ... Vinícius M. A. de Souza
01 Jan 2012
01 Jan 2012

The use of machine learning algorithms in recommender systems: A systematic review
Ivens Portugal ... Donald Cowan
Expert Systems with Applications | VOL. 97
Ivens Portugal, et. al.Ivens Portugal ... Donald Cowan
09 Dec 2017
Expert Systems with Applications | VOL. 97

Untersuchungen über die photosynthetische Leistung gelbblättriger Gehölze )
Klaus Michael
Flora oder Allgemeine Botanische Zeitung | VOL. 141
Klaus MichaelKlaus Michael
01 Jan 1953
Flora oder Allgemeine Botanische Zeitung | VOL. 141

Evaluation of machine learning algorithms for fast video transcoding in streaming services
Thiago Bubolz ... Guilherme Correa
-
Thiago Bubolz, et. al.Thiago Bubolz ... Guilherme Correa
29 Oct 2019
29 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

&lt;b&gt;A comparative study between MFCC and LSF coefficients in automatic recognition of isolated digits pronounced in Portuguese and English&lt;/b&gt; - doi: 10.4025/actascitechnol.v35i4.19825

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Acta Scientiarum. Technology

<b>A comparative study between MFCC and LSF coefficients in automatic recognition of isolated digits pronounced in Portuguese and English</b> - doi: 10.4025/actascitechnol.v35i4.19825