Abstract

Linear predictive coding (LPC) has been used to compress and encode speech signals for digital transmission at a low bit rate. PARCOR parameter associated with LPC that represents a vocal tract model based on a lattice filter structure is considered for speech recognition. The use of FIR coefficients and the frequency response of AR model were previously investigated. This paper reports a method to detect syllables from a continuous stream of speech. The system being developed slides a time window of 20 ms and calculates the PARCOR parameters continuously, feeding them to a syllable classifier. The syllable classifier is a supervised classifier that requires training. The training uses TIMIT speech database, which contains the recordings of 630 speakers of 8 major dialects of American English. The voiced/unvoiced switch built into the LPC vocoder was modified to segment words included in the speech records. Preliminary results of classification are presented in the paper

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.