Abstract

AbstractThis paper describes an automatic text‐ and speaker‐independent language identification method based on hidden Markov models (HMMs) for acoustic features. The hidden Markov modeling is used to represent the phonotactics for each language. Each language has its proper phonotactics.The HMM topology here is a fully structured (ergodic) model wherein any state could transit to all states. Two kinds of HMMs are used: the discrete HMM (DHMM) with the codebook and the continuous density HMM (CHMM). The HMM was trained using both the Baum‐Welch (forward‐backward) algorithm and the Viterbi algorithm. The latter was used for emphasizing the state transition probability.For comparison, experiments also were conducted on the identification using a mixtured Gaussian distribution model with one state. This single‐state Gaussian distribution model gave the same performance as the HMM trained with the Baum‐Welch algorithm. This is because the transition between states which reflects the characteristics of each language does not affect the likelihood scores very much. This problem was addressed by emphasizing the transition probabilities and using the Viterbi algorithm, which resulted in an improvement in the recognition rates. The trigram for optimal state sequence is introduced. Combining it with the HMM produced the best results.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.