Abstract

This paper presents the development of Thai text dependent speaker identification system by applying two feature-feeding approaches. A well-known multilayer perceptron (MLP) network with backpropagation learning algorithm is chosen. It has fast processing time and good performance for pattern recognition problems. But MLP has a limitation in that a network must have a fixed amount of input nodes. Therefore, the linear interpolation time normalization is chosen to adjust the input speech signal into a fixed size of input vector. Furthermore, the windowing technique is developed to avoid the distortion caused by a time normalization process. A fixed size window is sliced through the preprocessed features with fixed amount of overlapping frames. The high identification rate observed in experiments confirms that the developed windowing is suitable for the proposed Thai text-dependent speaker identification system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.