Method and apparatus for extracting speech related facial features for use in speech recognition systems

David G Stork

doi:10.1121/1.426998

Method and apparatus for extracting speech related facial features for use in speech recognition systems

David G Stork

https://doi.org/10.1121/1.426998

Copy DOI

Journal: The Journal of the Acoustical Society of America

Publication Date: Jan 1, 1999

#Use In Speech Recognition Systems #Visual Data + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The apparatus for the recognition of speech comprises an acoustic preprocessor, a visual preprocessor, and a speech classifier that operates the acoustic and visual preprocessed data. The acoustic preprocessor comprises a log mel spectrum analyzer that produces an equal mel bandwidth log power spectrum. The visual processor detects the motion of a set of fiducial markers on the speaker's face and extracts a set of normalized distance vectors describing lip and mouth movement. The speech classifier uses a multilevel time-delay neural network operating on the preprocessed acoustic and visual data to form an output probability distribution that indicates the probability of each candidate utterance having been spoken, based on the acoustic and visual data.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: The Journal of the Acoustical Society of America

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.