Computer-implemented methods and systems for modeling and recognition of speech

Marios Athineos,Daniel P W Ellis

doi:10.1121/1.3455415

Computer-implemented methods and systems for modeling and recognition of speech

Marios Athineos, Daniel P W Ellis

https://doi.org/10.1121/1.3455415

Copy DOI

Journal: The Journal of The Acoustical Society of America

Publication Date: Jan 1, 2010

#Frequency Domain Representation #Frequency Domain Linear Prediction + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In accordance with the present invention, computer implemented methods and systems are provided for representing and modeling the temporal structure of audio signals. In response to receiving a signal, a time-to-frequency domain transformation on at least a portion of the received signal to generate a frequency domain representation is performed. The time-to-frequency domain transformation converts the signal from a time domain representation to the frequency domain representation. A frequency domain linear prediction (FDLP) is performed on the frequency domain representation to estimate a temporal envelope of the frequency domain representation. Based on the temporal envelope, one or more speech features are generated.

Full Text