Speech recognition system and method employing data compression

Xuedong Huang,Shenzi Zhang

doi:10.1121/1.420248

Abstract

A data compression system greatly compresses the stored data used by a speech recognition system employing hidden Markov models (HMM). The speech recognition system vector quantizes the acoustic space spoken by humans by dividing it into a predetermined number of acoustic features that are stored as codewords in a vector quantization (output probability) table or codebook. For each spoken word, the speech recognition system calculates an output probability value for each codeword, the output probability value representing an estimated probability that the word will be spoken using the acoustic feature associated with the codeword. The probability values are stored in an output probability table indexed by each codeword and by each word in a vocabulary. The output probability table is arranged to allow compression of the probability of values associated with each codeword based on other probability values associated with the same codeword, thereby compressing the stored output probability. By compressing the probability values associated with each codeword separate from the probability values associated with other codewords, the speech recognition system can recognize spoken words without having to decompress the entire output probability table. In a preferred embodiment, additional compression is achieved by quantizing the probability values into 16 buckets with an equal number of probability values in each bucket. By quantizing the probability values into buckets, additional redundancy is added to the output probability table, which allows the output probability table to be additionally compressed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech recognition system and method employing data compression

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Journal: The Journal of the Acoustical Society of America	Publication Date: Jan 1, 1997
Citations: 1

Similar Papers

Discrete-Mixture HMMs-based Approach for Noisy Speech Recognition
Tetsuo Kosaka ... Masaki Koh
-
Tetsuo Kosaka, et. al.Tetsuo Kosaka ... Masaki Koh
01 Jun 2007
01 Jun 2007

Method and system for speech recognition using continuous density hidden Markov models
Xuedong D Huang ... Milind V Mahajan
The Journal of the Acoustical Society of America | VOL. 107
Xuedong D Huang, et. al.Xuedong D Huang ... Milind V Mahajan
01 Jan 1999
The Journal of the Acoustical Society of America | VOL. 107

System and method for speech recognition and transcription
Dung H Ky
The Journal of the Acoustical Society of America | VOL. 121
Dung H KyDung H Ky
01 Jan 2007
The Journal of the Acoustical Society of America | VOL. 121

An improved VQ codebook design algorithm for HMM
J.M Koo ... H.S Lee
-
J.M Koo, et. al.J.M Koo ... H.S Lee
01 Jan 1992
01 Jan 1992

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech recognition system and method employing data compression

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America