Abstract

AbstractThe problem in which the input signal sequence is segmented into multiple signal sources and the signal source is estimated appear in a wide range of problems, such as speech information processing and language processing.In this paper, this kind of problem is called the unknown‐multiple signal source clustering problem, and a solution method is proposed based on the ergodic HMM. In ergodic HMM, the state corresponds to the signal source and the symbol sequence output from the state corresponds to the signal sequence. Then, using the Viterbi decoding and the forward decoding, the segmentation point and the category can simultaneously be estimated. As an application of the problem, the classification of the utterances by multiple speakers is attempted. As a result of the experiment, it is shown that the initial parameter values are important in the ergodic HMM, and the LPC cepstrum with a long‐term window is useful as the feature vector reflecting the speaker individuality.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.