Enhanced speech recognition using an articulatory production model trained on X-ray data

C Simon Blackburn,Steve Young

doi:10.1006/csla.2001.0165

Enhanced speech recognition using an articulatory production model trained on X-ray data

C Simon Blackburn, Steve Young

https://doi.org/10.1006/csla.2001.0165

Copy DOI

Journal: Computer Speech & Language	Publication Date: Jul 1, 2001
Citations: 5

Affiliation: University of Cambridge

#Hidden Markov Model Recognition System #X-ray Microbeam Database + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper describes an articulatory speech production model trained on an X-ray microbeam database, and presents results of using the model within a speech recognition framework. The system uses an explicit statistical model of co-articulation to increase the accuracy of articulator trajectories synthesized from time-aligned phonetic strings, as compared with X-ray traces. From these trajectories, spectral vector probability distributions are generated using a set of artificial neural networks. The production model is then used in combination with a hidden Markov model recognition system to re-scoreN -best utterance transcription lists. Relative reductions in the word error rate of between 11% and 18% are achieved on a small recognition task.

Full Text