MDL-based context-dependent subword modeling for speech recognition.

Koichi Shinoda,Takao Watanabe

doi:10.1250/ast.21.79

MDL-based context-dependent subword modeling for speech recognition.

Koichi Shinoda, Takao Watanabe

Open Access

https://doi.org/10.1250/ast.21.79

Copy DOI

Journal: Journal of the Acoustical Society of Japan (E)	Publication Date: Jan 1, 2000
Citations: 236	License type: free

Affiliation: NEC (Japan)

#Hidden Markov Models #Minimum Description Length + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Context-dependent phone units, such as triphones, have recently come to be used to model subword units in speech recognition systems that are based on the use of hidden Markov models(HMMs).While most such systems employ clustering of the HMM parameters(e.g., subword clustering and state clustering)to control the HMM size, so as to avoid poor recognition accuracy due to a lack of training data, none of them provide any effective criteria for determining the optimal number of clusters.This paper proposes a method in which state clustering is accomplished by way of phonetic decision trees and in which the minimum description length(MDL)criterion is used to optimize the number of clusters.Large-vocabulary Japanese-language recognition experiments show that this method achieves higher accuracy than the maximum-likelihood approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Journal of the Acoustical Society of Japan (E)

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.